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(54) CANCEROUS iWETASTASIS-ASSOCIATED GENE 

(57) A gene exhibiting stronger expression in 
mouse IMC-HM cells than in mouse IMG-Uyi cells and 
isolated by using the differential display method. When 
the relation between the expression of the isolated gene 
and cancerous metastasis is examined, it is found out 
that a transformant with the larger expression dose of 
the antisense mRNA to the gene shows the more low- 
ered cancerous metastasis ability. By using the isolated 
protein, cancerous metastasis inhibitors can be 
screened. 
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Description 

Technical Field 

5 [0001] The present Invention relates to a protein associated with cancer metastasis and a gene encoding the same. 
Background Art 

[0002] In spite o1 the remarkable development of methods of diagnosis and treatment, cancer is still one of the main 
10 causes of death in most developed countries, including Japan. In malignant cancer, cancer cells not only randomly pro- 
liferate but also invade (infiltrate) surrounding organs and rneiastasize into distant organs. In fact, the majority of deaths 
due to cancer are caused by its recurrence by metastasis. Therefore, it seems possible to completely cure cancer by 
the surgical excision of cancerous foci if the metastasis or invasion of cancer cells is effectively suppressed. Now that 
a method for treating localized cancer cells is about to be established, the most preferred subject for improving the treat- 
15 ment of cancer is now likely to shift to how to treat metastasis. Hence, analyses of genes associated with the invasion 
and metastasis of cancer cells and proteins as gene products thereof are extremely important for developing new can- 
cer treatment methods. 

[0003] Hitherto, a number of cancer metastasis-associated genes have been reported. For example, "nm23" was dis- 
covered by the differential hybridization method in mouse melanoma K-1735 subtypes exhibiting different colony form- 
ic ing abilities when they are transplanted through mouse tail vein to the lung. Its metastasis potency and expression in 
Northern blot are inversely correlated. Transfer of cDNA of ''nm23" into a highly metastatic cell clone has been reported 
to result in the reduction of experimental metastasis potency (cf. Steeg, P. S. et al., J. Natl. Cancer Inst., 80, 200-204 
(1988) and Leone,A. et al., 25-35(1991)). Also, a gene highly expressed in low metastatic cell lines was isolated by the 
differential hybridization metfiod performed with low and high metastatic cells derived from rat prostate cancer R-3327 
25 cell line; said gene was reported to encode fibronectin. In addition, a 4- to 8-fold reduction of the expression of mRNA 
of fibronectin was reported in many cell clones which became highly metastatic by allowing said clones to express the 
v-Ha-ras gene (cf. Schalken. J. A. et al., Cancer Res., 48, 2042-2046 (1988)). Furthermore, a number of genes includ- 
ing KAI1 (Dong, J. T et al.. Science, 268, 884-886 (1995)), stromelysin-3 (Basset, R et al.. Nature, 348, 699-704 
(1990)), pGI\/l21 (Phillips, S. M. et al., J. Natl. Cancer Inst., 82, 199-203 (1990)). etc. have been reported as cancer 
30 metastasis-associated genes. 

[0004] However, most of these reports were limited only to studies with the reduction of migration and invasion abilities 
of cancer cells, in which the expression of a target gene had been suppressed, as indicators. No report has appeared 
on the suppression effect actually demonstrated in the assay system using experimental animals. 

35 Disclosure of the Invention 

[0005] An objective of the present invention is to provide a novel protein associated with cancer metastasis and DNA 
encoding said protein. Another objective of this invention is to provide a vector carrying sab DNA, a transformant har- 
boring said vector, and a method for preparing a recombinant protein that comprises culturing said transformant Still 
40 another objective of this invention is to provide DNA used for detecting, isolating, and amplifying said gene, or for sup- 
pressing the expression thereof. A further objective of this invention is to provide a method of screening a cancer metas- 
tasis inhibitor using proteins and genes associated with cancer metastasis. 

[0006] Mouse IMC-HM cells, mutant cells isolated from mouse IMC cell line by Arakawa et al., are highly metastatic 
to the liver. When IMC-HM cells are sutxiutaneously transplanted to the venfral of mice, first they spontaneously metas- 

45 tasize to the liver, then rapidly metastasize to the main organs of the whole body, leading to the death of the host in 
about two weeks. In contrast, almost no metastasis potency is observed with IMC-LM cells, a sut>culture of the parental 
IMC cells. Mice transplanted with said cells survive for at least six weeks after the fa-ansplantation, and no metastatic 
foci are observed at all with the naked eye (cf. Arakawa, H. et al., Jap. J. Cancer Res.. 87. 518-523 (1996)). Although it 
may be readily assumed that these two cell lines share a very close genetic background, they widely differ in properties 

50 such as the presence or absence of spontaneous metastasis potency to distant organs. In view of these facts, the 
present inventors suspected the presence of a key gene controlling cancer metastasis in mouse IMC-HM cells, and 
attempted to isolate a strongly expressed gene from mouse IMC-HM cells as compared with mouse IMC-LM cells using 
the differential display metfiod. As a result, the present inventas succeeded in isolating two mouse cDNA fragments, 
which are candidates for ttie factor inducing cancer metastasis. Furthermore, they screened a cDNA library derived 

55 from mouse IMC-HAl Cells using these cDNA fragments as probes, and succeeded in isolating full-length cDNA 
thereof 

[0007] The present inventors also investigated the relation between the expression of Is lated mouse cDNAs and can- 
cer metastasis. Specifically, they constructed a vector expressing antisense-mRNA against one of the isolated mouse 
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cDNAs (mouse CMAP). and expressed said vector in IMC-HA1 cells derived from IMC-HM cells to prepare many trans- 
formants which exhibited different expression levels of the antisense mRNA. The inventors then transplanted the trans- 
formants into mice to investigate their cancer-metastasis abilities. As a result, it was found that the higher the antisense 
M RNA expression level of the transformant is, the lower its cancer metastasis potency is. The inventors also succeeded 

5 in isolating human cDNA (human CMAP) that exhibited a high homology with mouse CMAP by the polymerase chain 
reaction using primers prepared based on the nucleotide sequence of isolated mous CMAP. 
[0008] Since a close relation was recognized between the expression of isolated goies and cancer metastasis, the 
present inventors found it was possible to screen cancer metastasis inhibitors using these proteins and genes. 
[0009] The present invention relates to a novel protein and gene associated with cancer metastasis, and to a method 

10 of using them to screen cancer metastasis inhibitors. More specifically, the present invention relates to: 

(I) a protein comprising any one of the amino add sequences set forth in SEQ ID NOs: 4, 6, 9, or 38, or a protein 
comprising any one of said amino acid sequences having substitution, deletion, or addition of one or more amino 
acids and having cancer metastasis potency: 

IS (2) a protein encoded by DNA which hybridizes with DNA comprising any one of the nucleotide sequences set forth 
in SEQ ID NOs: 3, 5, 7, 8, or 37. said protein having cancer metastasis potency; 

(3) a DNA encoding the protein according to (1); 

(4) the DNA according to (3) comprising any one of the nucleotide sequences set forth in SEQ ID NOs: 3, 5, 7, 8, 
or 37; 

20 (5) a DNA hybridizing with a DNA comprising any one of the nucleotide sequences set forth in SEQ ID NOs: 3, 5, 
7, 8, or 37, encoding a protein having cancer metastasis potency; 

(6) a vector can-ying the DNA according to any one of (3) to (5); 

(7) a transformant hartwring the vector according to (6); 

(8) a method for preparing the protein according to (1) or (2), comprising culturing the transformant set forth in (7); 
25 (9) a DNA specifically hybridizing with the DNA according to any one of (3) to (5), comprising at least 15 nucleotide 

residues; 

(10) an antisense DNA against the DNA according to any one of (3) to (5) or a portion thereof; 

(I I) an antitxxly binding to the protein according to (1) or (2); 

(12) a method for screening a compound having cancer metastasis inhibitory ability, comprising steps of: 

30 

(a) contacting a test sample with the protein according to (1) or (2), and 

(b) selecting compounds having the activity to bind to the protein according to (1) or (2);, 

(13) a method for screening compounds having cancer metastasis inhibitory ability, conprising steps of: 

35 

(a) contacting lest samples with cells expressing the protein according to (1) or (2), 

(b) detecting the expression level of the protein according to (1) or (2) in cells contacted with test samples using 
the antibody set forth (11), and 

(c) selecting a compound which reduces the expression level of the protein according to (1) or (2) as compared 
40 with that in cells not contacted with test samples. 

[0010] The present Invention relates to a novel protein associated with cancer metastasis. The nucleotide sequence 
of cDNA isolated and designated "CMAP" by the present inventors Is shown In SEQ ID NO: 3, and the amino acid 
sequence encoded by said cDNA is shown in SEQ ID NO: 4. Mouse "CMAP" cDNA has been isolated by the differential 

45 display method as a gene which is highly expressed in a mouse cancer metastatic IMC-HM cell line derived from mouse 
IMC cells. Its expression was not detected in the IMC-U\1 cell line, which had been similarly derived from mouse IMC 
cell line but had almost no detectable cancer metastasis potency (Cf. Example 1). Studies on the expression of mouse 
CMAP in various cancer cells revealed that it was expressed in M-5076 cells, L51 78Y cells, P388 cells, and LI 21 0 cells, 
but not in IMC-HA1 cells, which formed metastasis fod mainly In the liver when transplanted through the tail vain of 

50 mice. In contrast, B-16-BL6 cells and Colon 25 cells, in which the expression of mouse CMAP was not detectable, 
strongly induced the experimental metastatic foci in the lung, but did not exhibit hepatic metastasis. All other cell lines 
in which mouse CMAP was expressed below the detection limit were almost completely rejected and regressed when 
they were transplanted to the tail vein (cf. Example 7). Many transfbrmants that expressed antisense mRNA against a 
transcript of mouse CMAP cDNA in various levels were prepared and their cancer metastasis abilities in vivo were 

55 examined by transplanting them into mice. As a result, it was found out that higher expression levels of antisense mRNA 
of transformant were related with lower cancer metastasis potendes of said transformant (cf. Example 9). These tacts 
indicate a dose association of mouse CMAP expression with cancer metastasis. 

[001 1 ] In addition, the present inventors isolated cDNA (human CMAP cDNA) exhibiting a high homology v/ith mouse 
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CMAP from human spleen by polymerase chain reaction using primers pr^^ared based on the nucleotide sequence of 
nwuse CMAP cDNA. This cDNA's nucleotide sequence is shown in SEQ ID NO: 37, and the amino acid sequence of 
the protan encoded by said cDNA is shown in SEQ ID NO: 38. Human CMAP had a homology as high as 71.6% with 
mouse CMAP in the amino acid sequence (Cf. Example 8). This fact strongly indicates that human CMAP functions 

5 similarly to mouse CMAP. 

[001 2] Furthermore, as in the case of mouse CMAP, the inventors isolated three cDNA clones (#7.1 641 1 , #8.323, and 
#11.24413) that were overexpressed in mouse IMC-HM cells, by the differential display method using mouse IMC-HM 
cells and IMC-LM cells. Nucleotide sequences of three cDNA clones are shown in SEQ ID NOs: 5, 7, and 8. Compari- 
son of these nucleotide sequences strongly indicated that '#7. 1 641 1 " and '#1 1 .2441 3' are produced when the #8.323 

10 is processed differently Clones "#7.16411" and "#11.24413" have an identical nucleotide sequence except for about 
1 GC bp at the 5" -end, and were presumed to encode proteins of 1 31 a.id 1 26 amino acids with 20 to 30 differe.it a.mino 
acids of the N-terminus. Amino acid sequences of proteins encoded by "#7.1 641 1 " cDNA and "#1 1 .2441 3" cDNA are 
shown in SEQ ID NOs: 6 and 9, respectively When the expression patterns of "#7. 1 641 1 " and "#1 1 .2441 3" in various 
cancerous and normal ceils were examined, "#7.1641 1 " was detected in almost all cancerous cells and normal tissues 

15 in spite of the difference in their expression level. However, "#1 1 .24413" was detected only in cancer cells such as IMC- 
HA1 cells derived from mouse IMC-HM cells, and P388 cells, and in normal tissues such as the thymus, lung, and 
spleen and slightly in the peripheral Wood (cf. Example 6). These results indicate that #8.323 to "#7.1641 1" are proc- 
essed in the majority of normal tissues, and "#1 1 .244 1 3" is assumed to be expressed in specific cells. "#11 .2441 3" may 
be a factor in providing IMC-HA1 cells with properties such as a rapid invasion of the whole body The above-described 

20 characteristics indicate the possibility of using these proteins for screening compounds having cancer metastasis inhib- 
itory ability. 

[001 3] The protein of this invention may be prepared as a recombinant protein as well as a natural protein using tiie 
genetic recombination technique by methods known to those skilled in the art. The natural protein, for example, can be 
prepared by subjecting extracts of cells or tissues that are expected to express the protein of the present invention to 

25 affinity chromatography using the antibody of the present invention described below. The recombinant protein may be 
prepared by culturing cells transformed with the DNA encoding the protein of this invention as described below. 
[0014] The present invention also relates to a protein functionally equivalent to the above-desaibed protein (mouse 
CMAP, human CMAP, #7. 1 641 1 , and #1 1 .2441 3). For example, a functionally equivalent protein can be isolated by a 
method of inducing the amino acid mutation in a protein. Such a method is known to those skilled in the art. Skilled per- 

30 sons would be able to prepare a mutant protein functionally equivalent to the natural protein by appropriately substitut- 
ing amino acid(s) in the atxwe-described protein (comprising any one of amino acid sequences set forth in SEQ ID 
NOs: 4, 6. 9, and 38) using Kunkel's method (Kunkel, T A. et al.. Methods Enzymol. 154. 367-382 (1987)), the double 
primer metiiod (Zoller. H. J. and Smith, H.. Methods Enzymol. 154. 329-350 (1987)). ttie cassette mutation method 
(Wells, et al., Gene 34. 315-23 (1985)). the megaprimer method (Sarkar, G. and Sommer. S. S., Biotechniques g, 404- 

35 407 (1990)), etc. Amino add mutation of a protein may also occur spontaneously Such a mutant protein derived from 
a natural protein by substituting, deleting, or adding amino acids in its amino acid sequence, and proteins functionally 
equivalent to the natural protein are also included in the protein of the present invention. Here, functionally equivalent" 
means that the protein has cancer metastasis potency. Cancer metastasis potency of the protein can be detected by, 
for example, suppressing the expression or function of said protein in metastatic cancer cells (for example, using the 

40 antisense technique), transplanting said suppressed cells into an animal body, and examining its cancer metastasis 
potency (Cf. Example 9). The extent of amino add mutation in the functionally equivalent protein compared with the nat- 
ural type protein is usually 10% or less of tiie total amino acids, preferably 10 amino adds or less, more preferably 3 
amino adds or less, still preferably 1 amino add. 

[001 5] Another method known in the art for isolating the functionally equivalent prolan is the hybridization technique 
45 (Hanahan. D. and Meselson, H., Method. Enzymol. 100. 333-342 (1983) and Benton. W. D. and Davis. R. W.. Sdence 
196. 1 80-1 82 (1 977)). Those skilled in the art can easily obtain a protein functionally equivalent to the above-described 
protein from DNA that is isolated based on ttie DNA sequences (SEQ ID NOs: 3. 5. 7. 8, and 37) encoding the atxjve- 
desCTibed protein or a portion thereof and highly homologous with said DNA sequences. Such a protein encoded by 
DNA hybridizing with the DNAs encoding the afc>ove-d escribed proteins and functionally equivalent to said proteins is 
so also included in the protein of the present invention. Here, "functionally equivalent" means that the protein has cancer 
metastasis potency similarly as described above. Hybridization for isolating functionally equivalent proteins is usually 
performed in 6 x SSC and 40% formamide at 25°C and subsequent washing with 1 x SSC at 55°C. Preferably, hybrid- 
ization is performed in 6 x SSC and 40% formamide at 37°C and washing with 0.2 x SSC at 55°C. More preferably, 
hybridization is performed in 6 x SSC and 50% formamide at 37°C and washing with 0.1. x SSC at 62°C. Those skilled 
55 in the art can obviously realize such stringent hybridization conditions by appropriately selecting the dilutbn rate of 
SSC, concenti'ation of formamide, temp rature, etc. Animals from which functionally equival nt proteins are isolated 
include human, mouse, rat, rabbit, sheep, cow, dog, and pig, but are not limited thereto. A DNA encoding the protein 
tfius isolated usually has high homology with a nudeotide sequence of cDNA encoding th abov -described protein 
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(nucleotide sequences set forth in SEQ ID NOs: 3, 5, 7, 8. and 37). High homology means a sequence identity of at 
least 70% or more, preferaljly 80% or more, and more preferably 95% or more, at ttie nucleotide level. 
[001 6] In addition, the present invention relates to DNA encoding the above-described protein of tiie present invention 
(mouse CMAP, human CMAP, #7. 1 641 1 , #1 1 .2441 3, and a protein functionally equivalent thereto). There are no limita- 

5 tions in the type of the DNA of this invention as long as it can encode the protein of the invention. The DNA of this inven- 
tion thus includes synthetic DNA as well as cDNA and genoinic DNA. The DNA of this invention can be isolated by 
methods known in the art. For example, cDNA can be screened by labeling cDNA encoding the protein of this invention 
(for example, cDNA comprising any one of nucleotide sequences set forth in SEQ ID NOs: 3, 5, 7, 8, and 37) or a frag- 
ment thereof, RNA complementary to said cDNA, or a synthetic oligonucleotide comprising a portion of the nucleotide 

10 sequence of said cDNA with ''^P, etc., and then hybridizing the labeled product with a cDNA library derived from cells 
or tissues expressing the protein of this invention. A!tsr,",ativG!y, said cDNA 03.1 be cloned by synthesizing oligonucle- 
otides conesponding to the nucleotide sequence of said cDNA, and amplifying them using a polymerase chain reaction 
with cDNA derived from appropriate cells or tissues as a template. Genomic DNA can be screened by hybridizing a 
genomic DNA library with a probe such as cDNA encoding the protein of the present invention (for example, cDNA com- 

15 prising ttie nucleotide sequence of SEQ ID NO: 3, 5, 7, 8, or 37), a portion thereof, complementary RNA, or a synthetic 
oligonucleotide containing a part of the sequence of said DNA, which are labeled with ^^P and the like. Alternatively, 
genomic DNA can be cloned by amplification using polymerase chain reaction witti synthetic oligonucleotide primers 
having a sequence corresponding to the nucleotide sequence of ttie cDNA. Synttietic DNA can be prepared by, for 
example, chemically synthesizing oligonucleotides comprising a partial sequence of cDNA encoding the protein of ttie 

20 present invention (for example, cDNA comprising the nucleotide sequence set forth in SEQ ID NOs: 3, 5, 7, 8, or 37), 
annealing them to form a double sta-and, and ligating them using a DNA ligase (Khorana, H. G. et al. , J. Biol. Chem. 251. 
565-570 (1976); and Goeddel, D. V. et al., Proc. Nati. Acad. Sd. USA 76, 106-10 (1979)). 

[001 7] The DNA thus prepared is useful for producing a recombinant protein. The protein of the present invention can 
be prepared as a recon^inant protein by inserting DNA encoding the atx>ve-described protein of this invention (for 

25 example, cDNA comprising the nucleotide sequence set forth in SEQ ID NOs: 3, 5, 7, 8, or 37) into an appropriate 
expression vector, introducing said vector into appropriate cells, culturing transformants thus obtained, and purifying a 
protein thus expressed. Specifically, when Escherichia coli is a host, a plasmid vector such as pET-3 (Rosenberg, A. H. 
et al.. Gene 5g, 125-35 (1987)), pGEX-1 (Smith, D. B. and Johnson, K. S., Gene gZ. 31-40 1988)), etc. can be used. 
Escherichia coli can be transformed by Hanahan's method (Hanahan, D., J. Mol. Biol. 16g, 557-580 (1983)), ttie elec- 

30 troporation method (Dower, W.J. etal., Nucl. Acids Res. IS, 6127-6145(1988)), etc. A recombinant protein can be syn- 
thesized in the form of a fused protein in which it binds to the tag of histidine residues, glutathione S-fransferase (GST), 
etc. at the N-terminus, etc., and can be purified by binding said fused protein to metal-chelate resin, GST-affinity resin 
(Smith, M. C. et al., J. Biol. Chem. 263, 721 1-7215 (1988)), etc. The desired protein can be developed from the fused 
protein by cleaving the fused protein with thrombin blood coagulating factor Xa, etc. When a fission yeast, Schizosac- 

35 charomyces pombe, is a host, pESP-1 (Lu, Q. et al., Gene 2QQ. 135-144 (1977)) or the like plasmid vector is used. For 
example, yeast is transformed by the spheroplast method (Beach, D. and Nurse, P., Nature 290. 140 (1981)), lithium 
acetate method (OkazakI, K. et al., Nucl. Acids Res. 18, 6485-6489 (1990)), etc. When pESP-1 is used, a recombinant 
protein is syntiiesized as a fusion protein with glutathione S-ti^ansferase (GST) and is purified by binding the fusion pro- 
tein to GST-affinity resin. The protein of interest can be separated from the fused protein by cleaving it witti ttirombin, 

40 Wood coagulating factor Xa, etc. When mammalian cells such as CHO cells derived from Chinese hanreter ovary, 
human HeLa cells, etc. are ttie host, pMSG (CLONTECH) or the like vector is used. The recombinant DNA can be infro- 
duced into mammalian cells by ttie calcium phosphate method (Graham, R L. and van derEb, A. J., Virology 456- 
467 (1973)), DEAE-dextran mettiod (Sussman, D. J. and Milman, G., Mol. Cell. Biol. 4, 1641-1643 (1984)), lipofection 
method (Feigner, R L. et al., Proc. Nati. Acad. Sci. USA 84, 7413-7417 (1987)), electi-oporation method (Neumann, E. 

45 et al., EMBO J. 1, 841 -845 (1982)), etc. When insect cells are ttie host, ttie baculovirus vector pBacPAK8/9 (Clontech) 
or similar vectors are used. For example, insect cells can be transformed according to methods as described in 
Bio/Technology, S, 47-55 (1980). 

[0018] The present invention also relates to DNA specifically hybridizing witti the above-described DNA of ttie present 
invention and having at least 15 nucleotide residues. Here, "specifically hybridize" means ttiat DNA does not cross- 
so hybridize with other DNAs encoding other proteins under sti-ingent conditions. Such DNA can be utilized as a probe for 
detecting and isolating DNAs encoding the protein of this invention, and as a primer for amplifying said DNAs. 
[0019] The present invention also relates to antisense DNA against the DNA encoding the protein of the present 
invention or a portion ttiereof. Such antisense DNA is utilized to suppress the expression of ttie protein of the present 
invention. For example, it is possible to suppress the cancer metastasis ability of said cells by expressing such anti- 
55 sense DNA in cells with cancer metastasis potency to suppress the expression of ttie protein of this invention. In order 
to display th antisense ffect, the antisens DNA is at least 15 bp or more, preferably 100 bp or more, and usually 
shorter than 800 bp, preferably shorter ttian 600 bp. 

[0020] Furthermor , this invention relates to an antibody binding to the protein of the present invention. Antibodies 
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binding to the protein of the present invention can be prepared by methods l<nown in the art (cf. "Shin Seikagaku 
Jil<kenkoza 1 , Proteins I, 389-406, Tokyo Kagaku Dojin"). Polyclonal antitKXiies can be prepared as follows. A suitable 
amount of the above-described protein or peptide is administered to animals to be immunized such as rabbits, guinea 
pigs, mice, or chickens. The antigen may be administered together with an adjuvant (such as FIS and FCA) to stimulate 

5 the antibody production. Immunization is usually performed every several weeks. Repeated immunization allows ele- 
vating the antibody titer. After the final immunization, antiserum can be obtained by collecting the blood from immunized 
animals. Polyclonal antibodies can be prepared by fractionating the antiserum using the ammonium sulfate precipita- 
tion, anion exchange chromatography and affinity chromatography with protein A or the immobilized antigen. Mono- 
clonal antibodies can be prepared as follows. The protein of this invention or a partial peptide thereof is administered to 

10 animals to be immunized as descrit^ed above. After the final immunization, the spleen or lymph node is excised from 
immunized animals. Antibody-producing cells contained in the spleen or lymph node are fused with myeloma cells 
using polyethylene glycol to prepare hybridomas. The hybridomas of interest are then screened and cultured. Mono- 
clonal antibodies can be prepared from the culture supernatant and purified by fractionation using the ammonium sul- 
fate precipitation, anion exchange chromatography and affinity chromatography wHh protein A or the immobilized 

15 antigen. Antibodies thus prepared can be used not only for affinity purification of the protein of this invention but also for 
diagnosis and antibody treatinent of diseases (e.g. cancer) caused by ttie abnormal expression of the protein of ti^is 
invention, and detection of the expression level of the protein. For antitxxjy treatinent, the antibody to be used is pref- 
erat}ly a humanized antibody or human antitxxJy Humanized antibody, for example, a mouse-human chimeric antibody, 
can be prepared by isolating ttie antibody gene from mouse cells producing the antibody against the protein of this 

20 invention, recombining the gene of ttie constant region of H chain with that of ttie constant region of human IgE H chain, 
and inti'oducing the resulting recombinant gene into mouse myeloma J558L cells (Neuberger, N. S. et al.. Nature 314. 
268-270 (1985)). Human antibodies can be prepared by immunizing mice, whose immune system has been replaced 
witti that of humans, with ttie protein of this invention. 

[0021 ] The present invention also relates to a metiiod for screening a compound with cancer metastasis suppressing 
25 potency using the protein of this invention. One embodiment of the screening method according to ttiis invention com- 
prises steps of (a) contacting the protein of this invention whh test samples, and (b) selecting a compound that binds to 
the protein of this invention. Test samples used for the screening include purified proteins, expression products of genes 
(including gene library), extracts of tissues or cells, supernatant of cell cultures, synthetic low molecular weight com- 
pounds, and metabolites of microorganisms, but are not limited to them. Test samples used for the screening may be 
30 appropriately labeled witti a radioactive substance, a fluorescent substance, etc.. but are not limited to them. Binding of 
test samples with the protein of this invention can be detected by ttie label on a compound bound to ttie protein of this 
invention (for example, detecting the binding by the radioactivity or fluorescence intensity). When the TWO-hybrid sys- 
tem is used (Zen/os et al.. Cell 72. 223-232 (1994) and Fritz et al.. Nature 376. 530-533 (1995)). ttie binding can be 
detected by examining the activity of the reporter gene. Besides the above-described TWO-hybrid system, various 
35 methods known to ttiose skilled in the art can be applied to screening compounds binding to the protein of this inven- 
tion. The mettiods include ttie affinity purification method using a column to which the protein of this invention is Immo- 
bilized and various peptide display methods such as the phage display mettiod (F. Parmly and G. P. Smitti, Gene. 73, 
305 (1988)). 

[0022] Another embodiment of the screening mettiod of this invention comprises steps of (a) contacting test samples 

w with cells expressing the protein of this invention, (b) detecting the expression level of the protein of this invention in the 
cells contacted with said test samples using an antibody binding to the protein of this invention, and (c) selecting a com- 
pound ttiat reduces ttie expression level of ttie protein of this invention as compared with ttiat in cells not contacted with 
said test samples. In this screening method, there are no limitations in the type of cells to be treated with test samples, 
as long as they express the protein of this invention. Cells to be treated may thus include IMC-HM cells or cells derived 

45 from them such as M-5076 cells and L5178Y cells. Test samples may include purified proteins, expression products of 
genes (including libraries), extracts of tissues or ceils, supernatants of cell cultures, synthetic compounds of low nralec- 
ular weights, and metabolites of microorganisms but are not limited to these. Many known methods, including the 
ELISA method, immune precipitation mettiod. and Western tilotting method, are available without limitation for detecting 
the expression level of the protein of tills invention using an antibody The antibodies used are labeled for detecting ttie 

50 protein of this invention if required. Examples of labels include enzymes, radioactive substances, and fluorescent sut>- 
stances, but are not so limited. It is also possible to detect ttie protein of this invention without labeling ttie antitxxjy of 
the present invention but by labeling ttie molecule specifically binding to the antibody of this invention such as a sec- 
ondary antibody or protein A. A desired compound can also be screened by detecting ttie expression of mRNA as well 
as that of the protein described above. Tfie expression of mRNA can be detected by for example, reverse b'anscription 

55 PCR (RT-PCR ). G3PDH can be used as a control. Compounds isolated by the above screening methods may be can- 
didates for suppressing tti activity of the protein of this invention. These compounds can be utilized as cancer metas- 
tasis inhibitors. 

[0023] Effects of the compounds isolated by ttie above-described screening methods are preferably confirmed in vivo 
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as occasion demands. One embodiment of the in vivo detection method comprises steps of (a) contacting a compound 
isolated by the above-described saeening method with cells with cancer metastasis potency, (b) transplanting said 
cells to a nonhuman test mammal, and (c) detecting the cancer metastasis in said nonhuman test mammal to deter- 
mine whether said compound suppresses cancer metastasis. When mice are used as the test animal, mouse IMC-HM 
5 cells or those derived from them are preferably used as the cells with cancer metastasis ability. In this case, cells can 
be transplanted to mice sutjcutaneously or through the tail vein. Cancer metastasis can be detected by examining the 
formation of metastasis foci, for exanple. in the liver. 

[0024] Another embodiment of the in vivo detection of cancer metastasis suppressing ability of the compound is the 
method of directly administering the test compound to mice. This method comprises steps of (a) administering a com- 

10 pound isolated by the above-described screening method to nonhuman test mammals, (b) transplanting cells with can- 
cer metastasis potency to said nonhuman test mammals, and (c) detecting cancer metastasis in said nonhuman test 
mammals to determine whether said compound suppresses cancer metastasis. The compourxl can be administered 
orally, intraperitoneaily, through the tail vein, subcutaneously, intracutaneously, or intramuscularly. As in the above- 
desCTibed method, when mice are used as the test animal, cells with cancer metastasis ability are preferably mouse 

IS IMC-HM cells or those derived therefrom. These cells can be transplanted to mice sutxutaneously or through the tail 
vein, etc. Cancer metastasis can be detected by examining the formation of metastasis foci, for example, in the liver. 

Brief Description of the Drawings 

20 [0025] 

Figure 1 is a photograph showing electrophoretic patterns of CMAP expressed in various cancer cells detected by 
Northern blotting. 

Figure 2 is a photograph showing electrophoretic patterns of CMAP expressed in normal tissues detected by 
25 Northern blotting. 

Figure 3 compares amino acid sequences between CMAP and human family-2 cystatin. 

Best Mode for Implementing the Invention 

30 [0026] The present invention will be described in more detail below with reference to examples, but is not to be con- 
strued to be limited to these examples. 

Example 1 Cloning of cancer metastasis-associated gene 

35 (1 ) Extraction of the total RNAs from IMC-HM and IMC-LM cells 

[0027] The total RNAs were extracted from IMC-HM and IMC-LM cells using iSOGEN (Nippon Gene). First, cultured 
cells in the logarithmic growth phase (1x10^ cells) were collected and sedimented by centrifugation. After the culture 
supernatant was removed by aspiration, ISOGEN (1 ml) was added to the cells, and the mixture was allowed to stand 

40 at room temperature for 5 min to lyse the cells. Chloroform (0.2 ml) was then added to the mixture, and the resulting 
mixture was vigorously vortexed for 15 sec, allowed to stand at room temperature for several minutes, and centrifuged 
(4°C) at 12,000 X g for 15 min. The aqueous layer containing RNA was collected, and an equal volume of isopropanol 
was added thereto. The mixture was stirred, allowed to stand at room temperature for 10 min, and centrifuged again 
under cooling at 12,000 x g for 15 min. RNA thus sedimented was collected, rinsed with ethanol, and dissolved in ster- 

45 ilized pure water (50 ^1), which had been treated with diethyl pyrocartxinate (DEPC) to obtain a total RNA solution. 
When cultured cells (5x10^ cells) from which the total RNA was isolated were transplanted subcutaneously into the 
ventral of a five-week old female CDF1 mouse, observation with the naked eye revealed that IMC-HM cells metasta- 
sized to the liver of the host mouse to kill It in two weeks, while IMC-LM cells did not metastasize to the liver, allowing it 
to live for six weeks. 

50 

(2) Preparation of single-stranded cDNA using reverse transcriptase 

[0028] A complementary single-stranded DNA was prepared with the total RNA obtained above as a template using 
reverse transcriptase Superscript II (GIBCO BRL) derived from MMLV and an mRNA Fingerprinting Kit (Clontech). A 
55 cDNA synthesis primer (oligo dT primer) (1 ^l) was added to the total RNA (2 \ig), and the total mixture volume was 
made 5 jj by adding DEPC-treated, sterilized pure water. The mixture was kept at 70°C for 3 min then quickly cooled. 
A mixed solution (total volume 5 \iS) of the reaction buffer (5 x first strand buffer) (2 (J), dNTP mix (each 5 mM) (2 (il). 
and SupersCTipt II (200 unit/pl) (1 mI) was added to this mixture, and the resulting mixture was allowed to stand at 42°C 



7 



EP 0 974 652 A1 



for 1 hour in an air incubator. After the reaction was completed, the mixture was treated at 72°C for 1 0 min, diluted 1 60- 
fold with the sterilized pure watw. and stored at -20 °C . The RNA fingerprinting was then performed based on polymer- 
ase chain reaction (PGR) using this single-stranded cDNA solution. 

(3) Amplification of single-stranded cDNA by PGR 

[0029] Equal amounts of l_A Tag DNA polymerase (Takara Shuzo) (5 U/ ^l) and undiluted solution of TaqStart Ant- 
body (registered trade mark, Clontech) were mixed, and the mixture was allowed to stand at room temperature for a few 
minutes to make a 50 x polymerase mix. This mix (0.4 jil) was combined with 10 x LA PGR buffer (Takara Shuzo) (2 ^l), 
sterilized pure water (13.9 nl), dNTP mix (Glontech) (5 mM each) (0.2 jil), and [^^SJdATPaS (Amersham) (0.5 and 
the resulting mixture was used as a PGR master mix. To the stock solution of the single-stranded cDNA (1 jj) were 
added 20 ^M of P primer (P1-10/SEQ ID NOs: 10 to 19), 20 nM of T primer (T1-9/SEQ ID NOs: 20 to 28) (Glontech) (1 
pi each), and the alxjve-desCTibed PGR master mix (1 7 n1). The resulting mixture was overlaid with mineral oil (Sigma), 
and PGR was performed in a DNA thermal cycler PJ2000 (PERKIN ELN/IER CETUS) to amplify several genes at once. 
The PGR comprises one cyde of 94°G for 5 min, 40°C for 5 min, and 68°G for 5 min; two cycles of 94°G for 2 min, 40°C 
for 5 min, and 68°G for 5 min; 25 cycles of 94°G for 1 min, 60°G for 1 min, and 68°G for 2 min; and finally 68°G for 7 min. 

(4) Separation of PGR products by electrophoresis 

[0030] A plate for the sequence gel (20 x 40 cm) was assembled using a spacer according to the standard method. 
A 6% glycerol-resistant gel and glycerol-resistant buffer (United States Biochemical) were used. PGR products (3.5 jil 
each) amplified with P primer and T primer were mixed with a reaction termination solution (2 nl each), heated at 94°C 
for 2 to 3 min, and then cooled. The resulting mixture (3 n 1 each) was loaded onto the glycerol-resistant gel and elec- 
trophoresed at 35 W for about 3 hours. After electrophoresis, the gel was transferred onto a filter paper, which was dried 
at 80°G for 2 hours in a gel dryer. Radioactivity in the gel was copied on an imaging plate for BAS2000 (Fuji Film) by 
leaving it on said plate overnight, and imaged with BAS2000. 

[0031] This imaging analysis was used to search for genes whose expression level significantly differred between 
IMC-HM cells and IMG-LM cells. If the expression level of one gene is predominantly high in IMC-HM cells, said gene 
can be a candidate of a gene that induces cancer metastasis. Furthermae, if the expression level of one gene is pre- 
dominantly high in IMC-LM cells, said gene would be a candidate of a gene that suppresses cancer metastasis. Two 
bands of possible candidates for the factor inducing metastasis were found from IMG-HM cells. One band was detected 
in PGR products obtained from a primer set of P2 and T3, and the other from a primer set of P2 and T8. The former 
band was designated "23-1," and the latter band, "28-1." 

(5) Cloning of IMG-HM cell-specific cDNA bands 

[0032] IMG-HM cell-specific cDNA bands 23-1 and 28-1 were exdsed directly from the dried gel, and each gel seg- 
ment was boiled in sterilized pure water (40 nl) for 10 min to elute the desired cDNA fragments. This minute quantity of 
cDNA fragment was re-amplified as follows. 

[0033] 10 x EX Tag buffer (Takara Shuzo) (5 (il), dNTP mix (each 2.5 mM) (1 fil), EX Taq DNA polymerase (Takara 
Shuzo) (5 U/fxl) (0.5 and sterilized pure water (31.5 ^1) were mixed to make a PGR master mix. The eluted cDNA 
fragment (7 nl) and the P primer and T primer (2.5 jJ each) which had been used for detecting said cDNA band were 
added to this mix. The mixture was overlaid with mineral oil and subjected to PGR comprising 20 cycles of 94''G for 1 
min, 60°G for 1 min, and 68°C for 2 min in a DNA thermal cyder PJ2000 (PERKIN ELMER GETUS) to specHically 
amplify the desired cDNA fragment. PGR products were electrophoresed on 1% agarose gel using a Mupid (GOSMO- 
BIO) at 50 V for 1 hour and identified by staining with ethidium bromide. 

[0034] In general, PGR products of cDNA have additional A at their 3' -terminus, and may be directly cloned by a TA 
type vector (having T at its terminus). Therefore, cDNA fragments thus re-amplified were cloned using a TA cloning kit 
(Invitrogen) according to the following method. A 10 x ligation buffer (Invitrogen) (1 jd), pGR II vector (registered trade 
mark, Invitrogen) (25 ng/ml) (2 nl), sterilized pure water (9 nl), and T4 DNA ligase (Invitrogen) (1 yd) were added to re- 
amplified PGR fragments (5 nl), and the mixture was allowed to stand at 14°G overnight to insert the cDNA. A tube of 
One Shot INVaF' competent cells (registered ti-ade mark, Invitrogen) was mixed with 0.5 M p-mercaptoethanol (50 jil). 
The above-described vector previously ligated with cDNA (2 ^1) was added to this mixture, arxl the resulting mixture 
was incubated at 42''C for 30 sec. After standing on ice for 2 min, an SOG medium (450 (il) was added, and the mixture 
was shake-cultured at 37°C for 1 hour. The solution obtained after the above transformation procedure (100 fJ and 200 
(xl) was spread onto a previously prepared LB agar plat containing 50 ^g/ml kanamycin and X-Gal and allowed t 
stand at 37°C overnight. Th vector without the insertion reacts with X-Gal to turn blue while the one with cDNA incor- 
porated turns white. According to this principle, 10 white colonies were picked up and incubated again to be used for 
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sequencing; at the same time, replicas were prepared. Plasmids wer prepared from each transformant cell strain using 
a QIAprep Spin Plasmid Kit (QIAGEN) by the follcwing procedure. 

[0035] Each transformant cell strain was incutjated in an LB medium (3 ml) containing 50 ng/ml kanamycin at 37°C 
for about 8 hours and centrifuged to collect the cells. These cells wer dissolved in buffer PI (QIAGEN) (250 ^1). mixed 

5 with buffer P2 (QIAGEN) (250 further mixed with buffer P3 (QIAGEN) (350 nl). and then centrifuged at 10,000 x g 
for 10 min. TTie supernatant was transferred onto a QIAprep Spin column (QIAGEN) and centrifuged at 10,000 x g for 
1 min. The column was washed with buffer PB (QIAGEN) to remove the nuclease activity and further washed with the 
buffer PE (QIAGEN) (0.75 ml). The plasmid retained in the column was then eluted with the TE kxjffer and collected. 
The concentration of the plasmid was calculated from the A260 value determined with a Hitachi spectrophotometer (U- 

w 3300). 

(6) Sequencing of the cloned IMC-HM cell-specific cDNA fragment 

[0036] A nucleotide sequence of cDNA was determined by the dye terminator method (equipment used, ABI Model 
15 377). Although performance conditions may vary depending on the purification grade and concentration of plasmid 
serving as a template, in general the sequence consisting of 500 to 600 nucleotides may be determined at the same 
time using the dye terminator method. pCR II vector has the forward and reverse primer sites of Ml 3 near the insertion 
site. Using each primer previously fluorescence-labeled, both strands were compared to determine the respective 
sequences. AmpliTaq (25 nl), 5 x buffer (25 ^l), and purified water (125 nl) were mixed to dilute the enzyme, and premix 
20 was prepared for each of A, C, G, and T. To prepare premix of A or C, equal volumes of d/ddNTP, dye-primer, 5 x buffer, 
and the diluted enzyme (25 jil each) were mixed; the premix of G or T was prepared by mixing 50 nl each of these ingre- 
dients. The template DNA (100 to 200 ng/jil) (1 jj) and the premixes of A and C (4 jil each), and said template DNA (2 
nl) and the premixes of G and T (8 nl each) were separately mixed and subjected to cycle sequencing reaction with 25 
cycles of oe'C for 2 min, 96°C for 10 sec, 50°C for 5 sec, and 60°C for 4 min using a PERKIN ELMER 9600. Four clones 
25 randomly selected from 10 each cloned plasmids were sequenced, and all of them were found to have identical nucle- 
otide sequences. One of the plasmid clones obtained from 23-1 was then designated 23-1#2, and that obtained from 
28-1 was designated 28-1 #3. 

[0037] Next, sequences determined for 23-1 #2 (SEQ ID NO: 1) and 28-1 #3 (SEQ ID NO: 2) were analyzed with a 
computer using software from Genetics Computer Group, Wisconsin Package version 8.1. GenBank and EMBL data- 
30 base searches for the sequence homology with a FastA revealed that these sequences were entirely novel and had 
never been reported. 

Example 2 Establishing clonal lines reflecting properties of IMC-HM cells and IMC-LM cells 

35 [0038] Clonal lines of IMC-HM cells and IMC-LM cells were established by Uie limiting dilution culture method. Cells 
of each strain were disti-ibuted on a 96-well plate to 4 cells/10 wells, and 10 clones that had each proliferated from a 
single cell were randomly selected. 

[0039] In order to detect the expression of 23-1 #2 and 28-1#3 in each of these clones, RT-PCR was performed as 
follows. The total RNA and \he reverse transcriptase were prepared similarly to the akxjve-described methods ((1) and 
40 (2) in Example 1). In order to anplify the specific cDNA by PCR, single-st-anded oligo DNA primers, "23-1 #2F1" 
(CAGAATCTGCTCATGCAGTC (SEQ ID NO: 29)), "23-1#2Rr (CACTCCTTACTTTCCACCCC (SEQ ID NO: 30)), "28- 
1#3Fr (AACAGCATTTCCCTAAAGCTCGG (SEQ ID NO: 31)). and "28-1#3Rr (TGGAAACTACTTCCCTGCTCCCA 
(SEQ ID NO: 32)) were designed. 

[0040] Portions of the sequences of 23-1 #2 and 28-1 #3 can be specifically amplified using a set of 23-1 #2F1 and 23- 
45 1#2R1 andanottiersetof28-1#3F1 and 28-1#3R1. Thirty cycles of PCR were performed using single-stranded cDNAs 
prepared from cells derived from IMC-HM and IMC-LM cells as a template, and EX Tag DNA polymerase under condi- 
tions similar to tiiose in Example 1 (5). As a result, it was observed that both 23-1#2 and 28-1#3 were amplified as 
cDNAs of a similar appropriate size in the IMC-HM-derived clonal cells, while no signals were detected in the IMC-LM- 
derived clonal cells. 

50 [0041] Next, in order to confirm the metastasis potency of each of the clonal cells in vivo, 5x10'' cells were subcu- 
taneously transplanted to CDF1 mice. All clonal cells derived from IMC-HM killed the host two to three weeks after the 
transplantation, arxl their remarkable metastasis was observed with the naked eye in the liver and spleen while no 
metastasis foci were observed at all with the cells derived from IMC-LM. In addition, no distinct difference was detected 
between the cells derived from IMC-LM and those derived from IMC-HM, even by the observation of morphology pro- 

55 deration ability in vitro, etc. One clonal cell strain each was selected from ttie IMC-HM- and IMC-LM-derived clonal cells 
and designated "IMC-HAI" and "IMC-LE5," respectively. The IMC-HA1 c II was deposited witti the following depositary 
institution: 
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(a) Name of the depositary institution and its address: 

Name: National Institute of Bioscience and Human-Technology Agency of Industrial Science and Technology 
Address: 1-3, Higashi 1-chome Tsukuba-shi Ibaraki-ken 305-0046 Japan 

5 

(b) Date of deposition (Date of original deposition): April 1 , 1997 

(c) Accession Number: PERM BP-6242 

Example 3 Analysis of clones 23-1#2 and 28-1#3 in IMC-HA1 and IMC-LE5 cells by Northern blot 

10 

[0042] In order to examine whether the expression of clones 23-1 #2 and 28-1 #3 differs at the poly(A) RNA leveL 
Northern t)lot analysis was performed. Poly(A) RNA was prepared from IMC-HA1 and IMC-LE5 cells using a Messen- 
ger RNA Isolation Kit (STRATAGENE) as follows. 

[0043] About 10° cells were recovered from a culture suspension (100 ml) and dissolved in a denaturing solution 

15 (STRATAGENE) (5 ml) containing f> -mercaptoethanol (50 n I)- An eluton buffer (STRATAGENE) (10 ml) was added to 
this mixture, and the resulting mixture was centrifuged at 12,000 x g for 10 min. The supernatant was mixed with oligo- 
dT cellulose (0.04 g/ml) (5 ml) and gently stirred at room temperature for 15 min. After the mixture was centrifuged at 
700 x g for 3 min. the supernatant was removed, and the cellulose was resuspended in a high-salt buffer (STRATA- 
GENE) (5 ml). This washing process was repeated twice, and then the cellulose residue was suspended in a low-salt 

20 buffer (STRATAGENE) (5 ml). The cellulose suspension was then packed into a 2.5-ml push column (STRATAGENE), 
and the remaining poly(A) RNA was eluted with an elution buffer (400 ^1) at 68°C. 3 M sodium acetate (40 ^1) and 100% 
ethanol (1.1 ml) were added to the eluate, and the mixture was centrifuged at 15,000 x g. Precipitates thus otitained 
were rinsed with 80% ethanol and dissolved in a TE buffer containing no RNase (40 ^1) to make a poly(A) RNA solution. 
Concentration was determined by measuring the absorbence (A260). 

25 [0044] Next, for Northern blot analysis, 23-1 #2 and 28-1 #3 were directly radio-labelled with [a-^^P]dCTP using a Meg- 
aprime DNA labelling system (Amersham) according to the standard method. The primer (5 fd) was added to cDNA 
(100 ng) to make the total volume 33 ^l, and the mixture was heated at 95°C for 5 min and allowed to stand at room 
temperature. A labelling buffer (Amersham) (10 [a-^^P]dCTP (5 ^l), and Klenow DNA polymerase I (1 unit/ jil) (2 (J) 
were added to ttiis mixture , and the mixture was incubated at 37 °C for 10 min. To terminate the reaction, 0.5 M EDTA 

30 (3.5 ul) was added to this mixture , and the labelled probe and unreacted [a-^^P]dCTP were separated using a Sepha- 
dex G-50 spin column. Eluates from the column were diluted witin purified water, and radioactivity of the probe was 
counted with a liquid scintillation counter. 

[0045] Northern blot analysis was then performed. Poly(A) RNA (2 p g) was loaded on a 1% agarose gel prepared in 
1 X MOPS buffer and electrophoresed at 50 V for 1 .5 hour. After the electrophoresis, the gel was left standing overnight 
35 to transfer the nucleic add from the gel onto a Hylxnd N+ membrane (Amersham) via 10 x SSC Ixiffer. After the trans- 
fer, the membrane was air-dried, and the nucleic acid on the membrane was fixed by UV with a STRATALINKER. After 
the UV treatment, hybridization was carried out as follows. 

[0046] The membrane was soaked in a prehybridization solution (containing 5 x Denhardt's solution, 50% formamide, 
2% SDS, and denatured salmon sperm DNA (100 ^g/ml)), and prehybridization was performed at 42°C for 1 hour, [a- 
40 ^^PjdCTP-labelled 23-1#2 and 28-1#3 were then added to tiie above-described prehybridization solution as the probe, 
and said membrane was allowed to stand at 42°C overnight. After the hytxidization, the memljrane was washed once 
with 2 X SSC in 0.1% SDS solution at 50°C for 30 min and twice with 0.5 x SSC in 0.1% SDS solution at 50''C for 30 
min. The radioactivity was imaged according to the above-described method. 23-1#2 and 28-1#3 exhibited bands of 
about 1 .1 kb and 0.7 kb, respectively, which hybridized only witii the bands from IMC-HA1 Cells. 

45 

Example 4 Analysis of dones 23-1#2 and 28-1#3 in IMC-HA1 and IMC-LE5 cells by Southern blot 

[0047] Genomic DNA was prepared from IMC-HA1 and IMC-LE5 cells using a DNA Extraction Kit (STRATAGENE) 
and digested witti EcoRI. The digested products were loaded (10 ^g each) onto a 1% agarose gel prepared in a TAE 
50 buffer, separated by electi-ophoresis at 50 V, ti-ansferred to Hybond N+, and subjected to Southern blot analysis. Both 
cDNA probes exhibited identical band patterns. Therefore, it was indicated that both 23-1 #2 and 28-1 #3 were present 
in the genome of both IMC-HA1 and IMC-LE5 cells but expressed more predominantiy in IMC-HA1 cells. 

Example 5 Cloning of full-length cDNAs of clones 23-1 #2 and 28-1 #3 

55 

(1) Preparation of cDNA library derived from IMC-HA1 cells 

[0048] A cDNA library derived from IMC-HA1 cells was prepared using a ZAP Express cDNA Synthesis Kit (STRAT- 
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AGENE) by the following method. 

(a) Preparation of the total RNA and poly(A) RNA 

10049] The total RNA was extracted from IMC-HA1 cells by the, method described In Example 1 (1). In this case, 
extraction with ISOGEN was repeated twice to reduce the contamination of proteins, lipids, DNA fragments, etc. From 
this total RNA, poly(A) RNA was extracted as follows. A spin column was prepared by placing a Whatmann 3M filter 
paper on the bottom of a 2.5-ml syringe and filling it with 5 ml of ollgo(dT)-cellulose (0.04 g/ml) (STRATAGENE). The 
column was loaded with a sample, allowed to stand at room temperature for 30 min, then washed three times with a 
high-salt buffer and a low-salt buffer (Clontech). After elution with an elution buffer (Clontech), a 1/5 volume of a sample 
buffer was added to the s!uate,the resulting mixture was loadeded orrto a new column, and the same procedure was 
repeated. After precipitation with ethanol and rinse, the product was dissolved in a TE buffer to make a poly(A) RNA 
solution. 

(b) Synthesis of the first strand cDNA 

[0050] Poly(A) RNA (5 ng) and Xhol linker primer (170 pmol) were mixed, treated at 70°C for 5 min, then cooled in 
ice. 5 x first strand buffer (10 jJ), 0.1 M dithiothrettol (GIBCO BRL) (5 ^1), dNTP mix (10 mM each) (3 jj), and Rnasein 
(40 U/jd) (STRATAGENE) (1 nl) were added to this mixture , and reverse transcriptase Superscript II (GIBCO BRL) 
(12,000 U) was added to make a final volume of 50 jd. The resulting mixture was incubated at 44°C for 1 hour. 

(c) Synthesis of the second strand cDNA 

[0051] A 10 x second strand buffer (20 ^l), second strand dNTP mix (STRATAGENE) (6 jil), sterilized pure water 
(113.9 ^l), and [a -^^PjdATP (Amersham) (2 id) were added to the first strand cDNA synthesized above (45 ^ll), and E. 
coli RNase H (3 U) and E. coli DNA polymerase (STRATAGENE) (100 U) were added. The mixture was incubated at 
16°C for 2.5 hours to synthesize the second strand. 

(d) Insertion of cDNA into the ZAP Express Vector 

[0052] The free terminus of the Xhol linker primer was bJunted with Pfu DNA polymerase (STRATAGENE) (5 U). 
ligated to the EcoRI adaptor with T4 DNA ligase (4 U) by allowing the mixture to stand at 8''C overnight, and further 
phosphorylated at the EcoRI tenninus with T4 polynucleotide kinase (10 U). The Xhol linker terminus was cleaved by 
Xhol (120 U). cDNA was inserted into the ZAP Express Vector between the EcoRI and Xhol sites by reacting with the 
T4 DNA ligase at 12°C overnight 

(e) In vitro packaging and titer assay 

[0053] cDNA was packaged into phage using a Gigapack Gold II (STRATAGENE) exactly according to the process 
recommended by the supplier's protocol. The titer of the phage solution after packaging was assayed as follows. The 
original phage solution and its 10-fold dilution with an SM buffer were prepared separately, and 1 each of both solu- 
tions was incubated with 200 jJ of XL1 -Blue MRF (ODgoo = 0.5) at i7°C for 1 5 min to infect the host bacteria. Incubated 
mixtures were overlaid on a 100 mm NZY agar (GIBCO) plate together with 3 ml of an LB top agar. 0.5 M IPTG, and 
1 2.5 mg of X-Gal, arxl allowed to stand overnight. Ninety-five percent or more of plaques were found to retain the insert, 
and the volume of phage solution required for obtaining 50,000 plagues was determined from the number of plaques 
formed. 

(f) Amplification of cDNA library 

[0054] Appropriate amounts of phage-infected host bacteria were overlaid on 20 previously prepared 150 mm NZY 
agar plates together with the top agar so that 50,000 plaques/plate could be obtained. These plates were then incu- 
bated at 37°C overnight to form plaques. After 8 ml of SM buffer was added, the plates were allowed to stand at 4''C 
overnight. The SM buffer in which phage was sufficiently released was recovered, and the plates were further rinsed 
with 2 ml of the same buffer to recover the phage. This was distributed in aliquots as the cDNA library stock solution 
and stored frozen at -70''C. This stock solution was used for screening the cDNA library . 
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(2) Screening of cDNA library derived from IMC-HA1 cells 

(a) Primary screening 

[0055] Twenty NZY agar plates were prepared in sterilized square #2 petri dishes. XL1 -Blue MRF" adjusted to ODeoo 
= 0.5 was infected witti the cDNA library phage so as to obtain 50,000 plaquesykJiate. overlaid on the above NZY agar 
plates together with an LB top agar, and allowed to stand at 37°C overnight. The next day, after suitable plaque forma- 
tion was confirmed, the plates were cooled at 4-0 for 2 hours, and DNA in plaques was transferred to a Hytxjnd N+ 
membrane. The membrane was denatured, dried, UV cross-linked, then subjected to hybridization by the following pro- 
cedure. 

[0056] The membrane was soaked in a solution containing a 2 x PIPES buffer, 50% formamlde. 0.5% -SDS. a.nd dena- 
tured salmon sperm DNA (100 \i g/ml) and prehybridized at 42 °C for 1 hour. [ a - ^^PJdCTP -labeled 23-1 #2 and 28-1 #3 
were prepared according to the above-desaibed method and added as probes to the prehybridization solution. The 
mixture was allowed to stand at 42°C overnight. After hybridization, the membrane was washed three to four times with 
a solution containing 0.1 x SSC and 0.1% SDS at 65°C for 20 min. The radioactivity was imaged according to the 
above-described method. Images were printed out in the actual size of the plate using a BAS 2000. Points correspond- 
ing to positive plaques on the plate were punched out with a sterilized and blunted tip of a P1000, and the phage was 
eluted into an SM buffer. 

(b) Secondary screening 

[0057] Phage solutions that had been judged positive in the primary screening were further screened by a procedure 
similar to that described above. In this case, a serial 10-fold dilution of the phage solution was prepared and overlaid on 
100 mm plastic dishes in place of sterilized square #2 petri dishes. The phages localized at the point judged positive in 
the secondary screening were eluted into the SM buffer, and the DNA insert was amplified using a set of T7 and T3 
primers. At this stage, when many inserts were detected, a similar screening process was continued. 
[0058] This saeening of the cDNA library yielded a phage clone #9.3133 that specifically binds to 23-1 #2, and clones 
#7.1641, #8.32, and #11.2441 that specifically bind to 28-1 #3 were obtained. 

(c) Cloning into pBK-CMV phagemid vector 

[0059] It is possible to directly excise out the ZAP Express vector in vivo as the pBK-CMV phagemid vector containing 
inserts in the presence of a helper phage (ExAssist helper phage). The pBK-CMV phagemid vector was excised out 
from the above-described phage clones, introduced into XLOLR cells, and allowed to form colonies on kanamydn-con- 
taining LB plates for cloning. Phagemid vector clones #9.31334, #7.1641 1, #8.323, and #1 1 .24413 were thus otrtained. 

(d) Sequencing of clones #9.31334, #7. 1 6411 , #8.323, and #1 1 .2441 3 

[0060] Partial sequencing of the clones was performed by the dye primer method using the pBK-CMV phagemid vec- 
tor containing these cDNAs. Specifically; sequencing was performed similarly as described in Example 1 (6), except 
that the cycle sequencing reaction was canied out using T3 primer (AATTAACCCTCACTAAAGGG/SEQ ID NO: 33) or 
T7 primer (GTAATACGACTCACTATAGGGC/SEQ ID NO: 34). Furthermore, a primer to be used in RT-PCR and an 
appropriate primer were designed based on the sequence of the insert in the pBK-CMV phagemid vector whose cDNA 
sequence was partially determined by the dye primer method. The undetermined region was then sequenced by the 
dye terminator method using a PRISM Ready Reaction Terminator Cycle Sequencing Kit. Double-stranded DNA (250 
to 500 ng). primer (3.2 pmol). and the atove-described premix (8 nl) were mixed in a final volume of 20 (il. and the cycle 
sequencing was performed by the reaction of 96°C for 2 min, and 25 cycles of 96°C for 10 sec, SCO for 5 sec, and eo^C 
for 4 min. Electrophoresis and analysis were performed under conditions similar to those in the dye primer method. 
[0061] As a result, clone #9.31334 having the whole length of 983 bp (SEQ ID NO: 3) was otitained for 23-1#2. This 
sequence was found to be a novel gene comprising the region encoding 167 amino acids. This amino acid sequence 
exhibited atxjut 38% homology to the bovine cystatin B, an inhibitory protein for cathepsins. Furthermore, three cDNA 
clones (#7.16411, #8.323. and #11.24413) were obtained for 28-1#3. Comparative studies on these nucleotide 
sequences revealed strong indications that #8.323 can be processed differently to produce #7.1641 1 and #1 1 .24413. 
Clone #8.323 was 1,114 bp while clones #7.16411 and #11.24413 were 691 bp and 684 bp, respectively Clones 
#7.1641 1 and #11 .2441 3 represented an identical nucleotide sequence except for atwut 100 bp on the 5'-terminus side 
and were considered to encode the proteins of 131 and 126 amino acids with a differenc of 20 to 30 amino adds at 
the N-terminus. From the deduced amino add sequences, #7.1641 1 was considered to have a hydrophobic N-termi- 
nus. while #11.24413 was considered to be hydrophilic. A homology search indicated that three kinds of sequences 
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otjtained using 28-1#3 also represented novel genes similar to #9.31334. Furthermore, a protein presumed to be 
encoded by #7.1641 1 exhibited nearly 60% homology to human SLPI (secretoary leul<oprotease inhibitor). Nucleotide 
sequences of these cDNA clones are shown in SEQ ID NO: 5 (nucleotide sequence of the clone #7.1641 1), SEQ ID 
NO: 7 (nucleotide sequence of the clone #8.323), and SEQ ID NO: 8 (nucleotide sequence of the clone #1 1 .24413). 

Example 6 Expression of #7. 1 641 1 and #1 1 .2441 3 in various cancer cells and normal tissues 

[0062] Primers which specifically amplify #7.1641 1 and #11.24413 were designed, and the expression of these 
clones in various cancer cells and normal tissues was examined by RT-PGR. PGR was performed under the conditions 
in which each cDNA can be quantitatively detected in IIWC-HA1 cells (conditions described below^ in Example 7). As set 
of primers. "7F2" (SEQ ID NO: 35) and "28-1#3F1" (SEQ ID NO: 31) were used to amplify #7.1641 1 . and "1 1 F2" (SEQ 
iD NO: 49) and "28-l#3Fr (SEQ ID NO: 31). to amplify #1 1.24413. As a result, it was found that #7.1641 1 was able to 
be amplified in almost all cancer cells and normal tissues examined, though the gradation of developed color differed 
somewhat. In contrast. #1 1 .24413 could be detected only in cancer cells such as the P388 and MC-HM cells, and in 
normal tissues such as the thymus, lung, and spleen, and slightly in peripheral blood. From these results, it is highly 
possible that the processing to produce #7.16411 is common in the majority of normal tissues, while #11.24413 is 
expressed only in particular cells. #1 1 .24413 was considered to be an important factor to impart to the IMC-HA1 cells 
characteristics such as the ability to rapidly invade the whole body 

Example 7 Expression of mouse GMAP in various cancer cell lines and normal tissues 

[0063] The amino acid sequence deduced from the coding region of #9.31334 is designated mouse CMAP and is so 
called below. 

[0064] Expression of mouse CMAP was examined in 12 different mouse cancer cell lines by reverse transcription 
PGR (RT-PCR)(ngure 1). Total RNA was extracted from each cell line by a method similar to that in Example 1 (1), and 
a complementary single-stranded cDNA was prepared similarly to that in Example 1 (2). Mouse GMAP was then 
detected by PGR similarly as in Example 1 (5). The reaction was comprised of 94''G for 2 min; 30 cydes of 94°G for 1 
min and 68°C for 2 min; and GB'C for 5 min, using a set of primers "23-1#2Fr (SEQ ID NO: 29) and ''23-1#2Rr (SEQ 
ID NO: 30). Preliminary examination of the dilution of a single-stranded cDNA serving as a template confirmed that sub- 
quantitative conditions producing a nearly linear relationship between the dilution and product yield were established 
for IMC-HA1 cells when CMAP was diluted 10-fold for the anrplification and the control G3PDH was diluted 40-fold for 
the detection. 

[0065] Mouse GMAP was thus clearly expressed in M-5076 cells, LSI 78Y cells, P388 cells, and LI 21 0 cells in addi- 
tion to IMG-HA1 cells. Anatomical observation with the naked eye and pathological analysis confirmed that all of these 
cell lines formed metastatic fod mainly in the liver when 1 x 1 0^ cells were transplanted through the tail vein of mice. B- 
16-BL6 cells and Colon 26 cells in which almost no expression of mouse CMAP could be confirmed strongly induced 
an experimental metastatic foci in the lung, but not in the liver. Furthermore, all other cell lines wherein the expression 
of mouse GMAP was below the detection limit were almost rejected and regressed after the b'ansplantation through the 
tail vein of mice. These results indicated that the expression of mouse CMAP could be dosely associated with the 
metastasis of cancer cells to the liver. 

[0066] In addition, 15 tissues were surgically excised from two to three 5-week-old normal female CDF1 mice and 
disrupted using a Polytron homogenizer. The expression of mouse GMAP was confirmed by RT-PGR similarly as 
described above (Figure 2). Mouse GMAP was found to be expressed in the thymus, spleen, and mesenteric lymph 
nodes, although the expression was dearly weaker than for 1MG-HA1 cells. These results indicated that mouse GMAP 
could also physiotogically function in normal tissues, profoundly associating with the immune mechanism. 

Example 8 Identification of human homologue of mouse GMAP 

(1) Database Search 

[0067] The present Inventors searched for a human nucleotide sequence showing homology to that of mouse GMAP 
in the Genbank and EMBL databases with a Fast A using VMS (Version 8) software from Genetics Computer Group. 
The results revealed that two Expressed Sequence Tags (ESTs), N47763 and N56875, showed 75.7% and 72.0% 
homologies to mouse CMAP, respectively These two ESTs hold a homologous sequence comprising 34 nucleotide res- 
idues, strongly indicating that they might be derived from human GMAP of the same origin. 
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(2) Amplification of human CMfiiP by RT-PCR 

[0068] Since mouse CMAP was expressed in the spleen among normal tissues of mice, the human spleen was cho- 
sen as the supply source of template DNA. A single-stranded cDNA was prepared from the total RNA (OriGene) (2 ^g) 
5 extracted from the human spleen tiy a method similar to that in Example 1 (2). Based on sequences of N47763 and 
N56875, eight primers (SEQ ID NOs: H-CHAP-1 to 8) were designed. PGR was performed on the template single- 
stranded cDNA using two primers including H-CMAP-1 and H-CMAP-2 similarly as in Example 1 (5). Reaction was car- 
ried out in the sequence 94°C for 2 min; 30 cycles of 94°C for 30 sec. 60°C for 30 sec, and 72°C for 90 sec; and 72°C 
for 5 min. The thermal cycler used was a PERKIN EU^ER 9600. 

10 

(3) Sequencing 

[0069] The nucleotide sequence of the cDNA product of about 0.8 kb (SEQ ID NO: 37) derived from ttie amplified 
human CMAP was directly sequenced according to a method similar to that in Exanple 1 (6) using primers H-CMAP-1 
IS to 8 (SEQ ID NOs: 39 to 46). Human CMAP exhibited a 65.9% homology to mouse CMAP In addition, human CMAP 
was located in the nucleotide residues from 83 to 583, and its amino acid sequence (SEQ ID NO: 38) possessed a 
71.6% homology to ttiat of nrouse CMAP. Therefore, it was sti-ongly indicated that these two proteins have approxi- 
mately the same function. 

20 (4) Comparison of amino acid sequences of human CMAP with family 2 cystatin 

[0070] Amino acid sequences of mouse and human CMAPs and human family 2 cystatin were compared by a multi- 
sequence comparison program, MegAlign, in the Lasergene (DNASTAR) (Figure 3). Consensus sequences of human 
family 2 cystatin (cystein residues at. positions 121, 132, 146, and 166; glycine residue at 59; QXVXG at 103 to 107; 

25 and VFW at 153 to 155) were found to be minimally conserved in CMAPs. However, tiie overall homology was 30% or 
less in all CMAPs. In particular, sequences at the N-terminus were characteristic and long, and found to conprise many 
cysteine residues which may impart specificity to CMAP. Theoretically, it is highly possible that N of position 84 is glyc- 
osylated, a characteristic not observed in other sequences in its vicinity Although CMAPs hold sequences resembling 
those hitheto reported for cystatins, they were found to comprise many characteristic amino acid residues, indicating 

30 the possibility of having properties differing from tiiose of known substances. 

Example 9 Decrease in cancerous metastasis potency of IMC-HA1 cells in mice due to the constitutive decrease of 
mouse CMAP 

35 [0071] It is generally agreed tinat alterations of the nature, quality or expression level of multiple overlapped genes are 
required for metastasis. Therefore, it was thought to be difficult to induce a single metastasis-associated gene in cells 
witti low metastasis potency in mouse to confirm tiie elevation of their metastasis ability. IMC-HA1 cells were therefore 
transformed with a vector in which a sequence complementary to the mouse CMAP mRNA was inserted to consti-uct 
stably b'ansformed clonal cells, which were examined to determine the change in metastasis potency in mice. 

40 

(1) Construction of pBK-AS-CMAP vector 

[0072] The pBK-CMV expression vector was treated witti restriction enzymes Nhel and Kpnl to prepare a linear vector 
whose lacZ promoter and multicloning sites were removed. The insert was prepared from the total mRNA of IMC-HA1 

4S cells by RT-PCR as the starling material. CMAP-F1 (SEQ ID NO:. 47) and CHAP-R1 (SEQ ID NO: 48) were used as a 
set of primers. The Nhel and Kpnl restriction enzyme recognition sites are introduced in these primers at their termini, 
and their amplification product corresponds to the entire noncoding and coding regions on the 5'-slde of mouse CMAP. 
The amplification product of about 0.5 kb was digested with resb-iction enzymes Nhel and Kpnl, purified with a QIAquick 
PCR Purification Kit (QIAGEN), and ligated to the above-prepared linear vector using a DNA Ligation Kit (Takara 

50 Shuzo). In tills way, the pBK-AS-CMAP vector that expresses mRNA acting as the antisense against mouse CMAP in 
cells was constructed. 

(2) Transfer of pBK-AS-CMAP vector into IMC-HA1 cells and selection of transformants 

55 [0073] After the pBK-AS-CMAP vector was introduced into E. coli competent cells and cultured on a large scale, plas- 
mids were prepared in larg amounts according to ttie standard method using a QIAfilter Plasmid Midi Kit (QIAGEN). 
The plasmid (about 50 fi g) was cleaved at one site with ApaLI to make it linear, purified by phenol exti-action and eth- 
anol precipitation, and dissolved in EP buffer (50 ;il) (containing 25 mM Hepes (pH 7.0), 137 mM NaCI, 5 mM KCI, 
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0.27mM Na2HP04, and 6mM dextrose). IMC-HA1 cells (about 1x10^ cells) at the logarithmic growth phase were 
recovered from the culture medium (100 ml) and resuspended in EP buffer to prepare a cell suspension of 4 x 10^ 
cells/ml. This cell suspension (450 ^i) was mixed with the plasmid solution prepared above (50 ^ 1 ), and plasmids were 
transfected into cells by electroporation under the conditions of 250 mV, 500 nF. and a single pulse using a GenePulser 

5 II (Bio-Rad). After the mixture was allowed to stand at room temperature for 10 min, cells were cultured under the usual 
conditions. After 24-hour cuHuring, transformed cells were continuously treated with neomycin (0.3 mg/ml) for 2 weeks 
to seject statile transformants. The cells were cloned by the limiting dilution method to obtain about 50 donal cells. The 
cells in wtiich the expression level of mouse CMAP was decreased were screened by RT-PCR. As a result, ttiree lines 
of cell clones, 57C4, 53A9, and 53E9, were obtained. Similar procedures were also performed with pBK-CMV vector 

10 with no insert, and an 1 885 cell line was obtained as the vector control cell clone. When the expression of mouse CMAP 
mRNA in these cell lines was confirmed by Northern blot analyses, said mRNA was expressed in the 18B5 cell line at 
almost the same level as that in the IMC-HA1 cell line, but only about 49%, 32%, and 16% in 57C4, 53A9, and 53E9 
cell lines. No particular changes in the proliferation and morphology during culturing were noticed in any of these cell 
lines. 

15 

(3) Confirmation of metastasis potency in vivo 

[0074] Metastasis abilities of cloned cell lines derived from IMC-HA1 cells with different expression patterns of mouse 
CMAP mRNA were tested in mice by three independent experiment systems. In experiment I, each cell line (5x10'' 

20 cells) was subcutaneously ti-ansplanted to mice. The liver was excised on tiie 15th day after the ti-ansplantation and 
subjected to observation of appearance, weight deternination, and calculation of the survival rate. Experiment 11 was 
performed in the same matter as experiment I except ttiat on the third day after the ti-ansplantation primary tumors at 
the ti-ansplantation site were excised together with proximal lymph nodes. In this experiment, a model of a minute met- 
astatic focus of IMC-HA1 cells in the liver can be prepared. Furthermore, in experiment III, cells (1 x 10^ cells) were 

25 direcfly transferred through the tail vein, and the liver was excised on the 13th day after ti-ansplantation. At the same 
time, a diachronic study of the survival was performed. The experiment revealed that the expression of mouse CMAP 
mRNA was closely associated with ttie formation of liver metastatic focus and survival of mice (Table 1). Note 1) in the 
table represents "Number of mice which survived for 60 days/Number of mice tested." 

30 
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Table 1 

Experiraent I 
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Snrvival days 


■•an 
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55 [0075] It is especially noteworthy that in experiment 1 1, one out of six mice in which 53A9 cells were transplanted and 
three (halO of six mice in which 53E9 cells were transplanted survived for 60 days or more without metastasis to the 
liver. These results suggested that the expression of mouse CMAP is profoundly associated with the formation of met- 
astatic focus in the liver, and that the gene acts mainly after cancer cells enter the vessel. 
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Industrial Applicabilitv 

[0076] Hie present invention provides a novel protein associated with cancer metastasis and a DNA encoding said 
protein. This invention also provides a vector carrying said DNA, a transformant harboring said vector, and a method 
for preparing a recombinant protein by cuHuring said transformant. In addition, this invention provides DNA to be used 
to detect, isolate, amplify, or suppress the expression of said gene. Furthermore, this invention provides a method for 
screening cancer metastasis inhibitors utilizing said protein, enabling development of novel cancer metastasis inhibi- 
tors. 
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Sequence Listing 

(1) Name of Applicant: Banzu Pharnaceutical Co., Ltd. 

(2) Title of the Inveation: Cancer Metastasis-associated Gene 
03) Reference Nuaber: Bl-OOIPCT 

(4) Application Nuaber: 

(5) filing Date: 

(6) Country where the priority application vas filed and the application ntmber 
of the applicaticm: Japan, No. Hei 9-105333 

(7) Priority date: April 8, 1997 

(8) Nuaber of Sequences: 49 

SEQ ID NO: 1 
SEQUENCE LENGTH: 656 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
HYPOTHETICAL: Ho 
ANTISENSE: Yes 
ORIGINAL SOURCE: 

ORGANiai: Nus Musculus 

CELL LINE: IMC-HAl cell 

SEQUENCE DESCRIPTION: SEQ ID NO: 1 

TATTATGHG AGTGATATTT TTTTTTTTAG GTGATCAGCT TTTTATrTAT GATAAATCAT 60 

CCCCATCAAT AHAGTCTGA TGGTAGCAGA CCAGACGGCT GCAGAATCTG CTCATGCAGT 120 

CGGGCATCAA CAAGCnCCC ACACTACCAC CHGnGACT GTATCTGCTT TGCTCTGCAC 180 

GTGCTCCAGT AATGTCATCA AAGGACTCTC TGCGTGCCCA CATCCTGAAT CTAGGTGTGT 240 
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AGCACCAGAG AATTCArTTT AGAAGAAAGT GTGAATACAG HAATCTCGA GAGGAGGGCA 300 

CTATATTCTC CCAGTCCAGC CCCCTGATCA CCCACTG6CA TGTTAATCTG CTCGCTGGTA 360 

AGCAGGCCCT GAGGATCTGA GATGGCCAG6 ATGGGGTG6A AAGTAAGGAG TGGGGAAAAC 420 

ACGAGTGGAT ATGTATCACA GCTGCAGTCT TGCTAAAGAG ACAGAAGTCA CTGGCAGAGG 480 

AGAACAGGCA CCTCAAAACT GTGGAGCCAG GGGATGACCC AGACHCAGA GTAGCAATAT 540 

10 AGAGTCCGCT TCAAGGCAGG GnGGTTTGG AAGTCACAGT TGTCCAGTTG GTGATGCATG 600 

GTCTTCCTGC ATGTAGTTCG GCCGAHTTC ACCTCCAGCA TTTAGTGAGG GT7AAT 656 
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SEQ ID NO: 2 
SEQUENCE LENGTH: 369 
SEQUENCE TYPE: nucleic acid 
STEANDEDNESS: double 
TOPOLOGY: linear 
■ MOLECULE TYPE: cDNA 
HYPOTHETICAL: No 
ANT 1 SENSE: Yes 
^5 ORIGINAL SOURCE: 

ORGANISM: Mus Musculus 
CELL LINE: IMC-HAl cell 

SEQUENCE DESCRIPTION: SEQ ID NO: 2 

CATTATGCTG AGTGATATCT TTTTTTTTGC ATAGAGAAAT GAATGCGTH ATTTATTTGC 60 

TCTCCAACAG CATTTCCCTA AAGCTCCGTG AATGCTGAGC CAAAAGGAGA TGHAGIGGT 120 

35 AAAACAGTAA CCCTGGTCCA CAGGATCCAA.GCCTGGGATA CAGGGAGCCG GGAGCCATGG 180 

AACCCGGAGC AGGGACGCCT GGGACCAGCG AAGTAGTCTC CAGAGCACAC CGAGCACGAG 240 

TCCAGAGCCG GCGCCAATGT CAGGGATCAG GCTCACATCG GGGGCAGGCA GACnTCCCA 300 

CATATACCCT CACAGCACTT GTATTTGCCG TCACACTCCC CGTCCCTCCA GCATTTAGTG 360 

^ AGGGHAAT 369 

SEQ ID NO: 3 
^ SEQUENCE LENGTH: 983 

SEQUENCE TYPE: nucleic acid 

STEANDEDNESS: double 

TOPOLOGY : linear 
5" MOLECULE TYPE: cDNA to sRNA 

HYPOTHETICAL: No " 



55 



19 



EP 0 974 652 A1 



AUTISENSE: No 

ORIGINAL SOUECE: 
^ ORGANISM: Mus Musculus 

CELL LINE: IMC-HAl cell 

IMMEDIATE SOURCE 
,^ LIBRARY : IMC-HAl cell-derived cDNA library 

CLONE: 23-1 #2 

FEATURE: 

NAME/KEY : CDS 
'5 LOCATION : 17.. 520 

IDENTIFICATION METHOD: E 

SEQUENCE DESCRIPTION: SEQ ID NO: 3 

20 

CTGAAGCTAC CCCACC ATG CCC TGG TCC TGG AGC TGT ACT TGC CGA GCA CTT 52 
Met Pro Trp Ser Trp Ser Cys Thr Cys Arg Ala Leu 
1 5 10 

25 GGC CCA CTA CAT GCT CCC TGC AH TCC CCA GCC ATG TGG CTG GCC AH 100 

Gly Pro Leu His Ala Pro Cys He Ser Pro Ala Met Trp Leu Ala He 

15 20 25 

CTG CTT GCC CTC TGC TGC CTA ACT TCT GAC ACC CAT GGG GCA CGC CCC 148 
3" Leu Leu Ala Leu Cys Cys Leu Thr Ser Asp Thr His Gly Ala Arg Pro 

30 35 40 

CCA GAT Tn TGT TCC AAA GAT TTG ATC TCC ACT GTG AAG CCA GGA TTC 196 
Pro Asp Phe Cys Ser Lys Asp Leu He Ser Ser Val Lys Pro Gly Phe 

35 

45 50 55 60 

CCC AAA ACA ATA GAG ACC AAT AAC CCA GGA GTG CTT AAG GCC GCC AGG 244 
Pro Lys Thr He GIu Thr Asn Asn Pro Gly Val Leu Lys Ala Ala Arg 
40 65 70 75 

CAC AGT GTG GAA AAG TTC AAC AAC TGC ACA AAT GAC ATC Tn HG TO 292 
His Ser Val Glu Lys Phe Asn Asn Cys Thr Asn Asp He Phe Leu Phe 

80 85 90 

AAG GAG TCC CAT GTC AGC AAA GCC CTG 6TA CAG GTG GTG AAA GGC CTG 340 
Lys Glu Ser His Val Ser Lys Ala Leu Val Gin Val Val Lys Gly Leu 
95 100 105 

50 AAA TAT ATG CTC GAG GTG AAA ATC GGC CGA ACT ACA TGC AGG AAG ACC 388 

Lys Tyr Met Leu Glu Val Lys He Gly Arg Thr Thr Cys Arg Lys Thr- 
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10 



15 



20 



35 



40 



45 



110 115 120 

ATG CAT CAC CAA CTG GAC AAC TGT GAC TTC CAA ACC AAC CCT GCC TTG 436 
Met His His Gin Leu Asp Asn Cys Asp Phe Gin Thr Asn Pro Ala Leu 
125 130 135 - 140 

AAG CGG ACT CTA TAT TGC TAG TCT 6AA GTC TGC 6TC ATC GCC TGG CTG 484 
Lys Arg Thr Leu Tyr Cys Tyr Ser Glu Val Trp Val He Pro Trp Leu 

145 150 155 

CAC AGT TIT GAG GIG CCT GTT CTC CTC TGC CAG TGACTTCTGT CTCTTTAGCA 537 
His Ser Phe Glu Val Pro Val Leu Leu Cys Gin 

160 165 
AGACTGCAGC TGTGATACAT ATCCACTCGT GTTTTCCCCA CTCCTTACTT TCCACCCCAT 597 
CCTGGCCATC TCAGATCCTC AGGGCCTGCT TACCAGCGA6 CAGAHAACA TGCGAGTGGG 657 
TCATCAGGGG GCTGGACTGG GAGAATATAG TGCCCTCCTC TCCAGAHAA CTGTAHCAC 717 
ACHTCnCT AAAATGAATT CTCTGGTGCT AGACACCTAG ATOAGGATG TCGCCACGCA 777 
GAGAGTCCTT TGATGACAH ACTGGAGCAC GTGCAGAGCA AAGCAGATAC AGTCAACAAG 837 
GTGGTAGTGT GGGAAGCTTC HGATGCCCG ACTGCATGAG CACAnCTGC AGCCGTCTGG 897 
TATGCTACCA TCAGACTAAT AHGATGGGG ATGATTTATC ATAAATAAAA AGCTGATCAC 957 
CTAAAAGCAA AAAAAAAAAA AAAAAA 983 

-SEQ ID NO: 4 
SEQUENCE LENGTH: 167 
SEQUENCE TYPE: aaino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE: 

ORGANISM: Mus Musculus 

CELL LINE: IMC-HAl cell 



SEQUENCE DESCRIPTION: SEQ ID NO: 4 

Met Pro Trp Ser Trp Ser Cys Thr Cys Arg Ala Leu 
1 5 10 

Gly Pro Leu His Ala Pro Cys He Ser Pro Ala Met Trp Leu Ala He 

15 . 20 25 

Leu Leu Ala Leu Cys Cys Leu Thr Ser Asp Thp His Gly Ala Arg Pro 
50 30 35 40 

Pro Asp Phe Cys Ser Lys Asp Leu He Ser Ser Val Lys Pro Gly Phe 
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45 50 55 60 

Pro Lys Thr lie Clu Thr Asn Asn Pro Gly Val Leu Lys Ala Ala Arg 

65 70 75 

His Ser Val Glu Lys Phe Asn Asn Cys Thr Asn Asp He Phe Leu Phe 

80 85 90 

Lys Glu Ser His Val Ser Lys Ala Leu Val Gin Val Val Lys Gly Leu 

AC i AA • nir 

SO J.UU IVO 

Lys Tyr Met Leu Glu Val Lys He Gly Arg Thr Thr Cys Arg Lys Thr 

110 115 120 

Met His His Gin Leu Asp Asn Cys Asp Phe Gin Thr Asn Pro Ala Leu 
125 130 135 140 

Lys Arg Thr Leu Tyr Cys Tyr Ser Glu Val Trp Val He Pro Trp Leu 

145 150 155 

His Ser Phe Glu Val Pro Val Leu Leu Cys Gin 
160 165 



SEQ ID NO: 5 
SEQUENCE LENGTH: 691 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to bENA 
HYPOTHETICAL: No 
ANTI SENSE: No 
ORIGINAL SOURCE: 

ORGANISM: Mus Musculus 

CELL LINE: IMC-HAl cell 
IMMEDIATE SOURCE: 

LIBRARY: IMC-HAl cell-derived cDNA library 

CLONE: 28-1 #3 
FEATURE: 
NAME/KEY: CDS 
LOCATION: 37.. 429 
IDENTIFICATION METHOD: E 

SEQUENCE DESCRIPTION: SEQ ID NO: 5 
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10 



20 



CTCGGGACTG GTCATCAGAG CTCCCCTGCC nCACC ATG AAG TCC TGC GGC CTT 54 

Met Lys Ser Cys Gly Leu 
1 5 

m CCT TTC ACQ GTG CTC CTT GCT CTG GCG ATC CTG GCA CCC TG6 ACT 102 
Leu Pro Phe Thr Val Leu Leu Ala Leu Gly He Leu Ala Pro Trp Thr 

10 15 20 

GTG GAA GCA GG€ MA AAT GAT GCT ATC AJLA ATC GGA C^CC TC-C GCT C-CT 150 
Val Glu Gly Gly Lys Asn Asp Ala lie Lys He Gly Ala Cys Pro Ala 

25 30 35 

AAA AAG CCT GCC CAG TGC CTT AAG CTI GAG AAG CCA CAA TGC CGT ACT 198 
Lys Lys Pro Ala Gin Cys Leu Lys Leu Glu Lys Pro Gin Cys Arg Thr 

40 45 50 

GAC TGG GAG TGC c'CG GGA AAG CAG AGG TGC TGC CAA GAT CCT. TGC GGT 246 
Asp Trp Glu Cys Pro Gly Lys Gin Arg Cys Cys Gin Asp Ala Cys Gly 
55 60 65 70 

TCC AAG TCC GTG AAT CCT GTT CCC ATT CGC AAA CCA GTG TGG AGG AAG 294 
25 Ser Lys Cys Val Asn Pro Val Pro He Arg Lys Pro Val Trp Arg Lys 

75 80 85 

CCT GGG AGG TGC GTC AAA ACT CAG GCA AGA TGT ATG ATG CTT AAC CCT 342 
,.Pro Gly Arg Cys Val Lys Thr Gin Ala Arg Cys Met Met Leu Asn Pro 
90 95 100 

CCC AAT GTC TGC CAG AGG GAC GGG CAG TGT GAC GGC AAA TAC AAG TGC 390 
Pro Asn Val Cys Gin Arg Asp Gly Gin Cys Asp Gly Lys Tyr Lys Cys 

105 110 115 

TGT GAG GCT ATA TGT GGG AAA GTC TGC CTG CCC CCG ATG TGAGCCTGAT . 439 
Cys Glu Gly He Cys Gly Lys Val Cys Leu Pro Pro Met 

120 125 130 

CCCTGACAH CGCGCCGGCT CTGGACTCGT GCTCGGTGTG CTCTGGAAAC TACTTCCCTG 499 
CTCCCA6GCG TCCCTGCTCC GGGTTCCATG GCTCCCGGCT CCCT6TATCC CAGGCTTGCA 559 
TCCTGTGGAC CAGGGTTACT GTTTTACCAC TAACATCTCC nnGGCTCA GCAHCACCG 619 
AGCTTTAGGG AAATGCTGH GGAGAGCAAA TAAATAAACG CATTCATnC TCTAAAAAAA 679 
AAAAAAAAAA AA 691 



30 



35 



40 



45 



SEQ ID NO: 6 
SEQUENCE LENGTH: 131 
SEQUENCE TYPE: anino acid 
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TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE: 

ORGANISM: Mus Musculus 

CELL LINE: IMC-HAl cell 
SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Met Lys Ssr Cys Gly Leu 
1 5 
Leu Pro Phe Thr Val Leu Leu Ala Leu Gly He Leu Ala Pro Trp Thr 

10 15 20 

Val Glu Gly Gly Lys Asn Asp Ala He Lys He Gly Ala Cys Pro Ala 

25 30 35 

Lys Lys Pro Ala Gin Cys Leu Lys Leu Glu Lys Pro Gin Cys Arg Thr 

40 45 50 

Asp Trp Glu Cys Pro Gly Lys Gin Arg Cys Cys Gin Asp Ala Cys Gly 
55 60 65 70 

Ser Lys Cys Val Asn Pro Val Pro He Arg Lys Pro Val Trp Arg Lys 

75 80 85 

Pro Gly Arg Cys Val Lys Thr Gin Ala Arg Cys Met Met Leu Asn Pro 

90 95 100 

Pro Asn Val Cys Gin Arg Asp Gly Gin Cys Asp Gly Lys Tyr Lys Cys 

105 110 115 

Cys Glu Gly He Cys Gly Lys Val Cys Leu Pro Pro Met 
120 125 130 

SEQ ID NO: 7 
SEQUENCE LENGTH: 1114 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
HYPOTHETICAL: No 
ANTISENSE: No 
ORIGINAL SOURCE: 

ORGANISM: Mus Musculus 

CELL LINE: IHC-HAl cell 
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IMMEDIATE SOURCE: 

LIBRARY: IMC-HAl cell-derived cDNA library 
* CLONE: #8.323 

SEQUENCE DESCRIPTION: SEQ ID NO: 7 

,0 CCAGAGGCCA GTTT6GGATG CCAAACCCCT ACCTAACCAG AAGAAGAGAA GAAAGGCCAC 60 

TGCCGAGGTC ACTTCCAGTA CTTGGAGGA6 AAAGC.AACGT TCCCATTTAC AGCTGAGTAA 120 

CAGGAGCCAC AAGGTATGTC T6ACTCAAAA GTTCAGGaC TCGATGACTG TGCGGTGCTG 180 

CCCAGTGTGT CTTCTTCAAT G7AACCTCAG GACCTAGAAC AGCACCnGC ATGTGCTCTC 240 

AGGTGGmC TCTGATGGCC TCATGGTCCT GCCTGAAACA GAAAGTCTGC CACCTACTTC 300 

TGTAGCAGCA AGACTCCTGT TCTGTGGCTA AGCTTCCTGC CTGTGCAAGA GCCACAGGGA 360 

GGGGCCAAAT GCATGCCACT GGGGCCACGC CTCCTGGTAA AGACATAAAT AGTGATCCTC 420 

20 GGGACTGGTC ATCAGAGCTC CCCTGCCTO ACCATGAAGT CCTGCGGCCT TTTACCTTTC 480 

ACGGTGCTCC TTGCTCTGGG GATCCTGGCA CCCTGGACTG TGGAAGGAGG CAAAAATGAT 540 

GCTATCAAAA TCGGAGCCTC CCCTGCTAAA AAGCCTGCCC AGTGCCTTAA GCTTGAGAAG 600 

CCACAATGCC GTACTGACTG GGAGTGCCCG GGAAAGCACA GGTGCTGCCA AGATGCTTGC 660 

^5 GGnCCAAGT GCGTGAATCC TGTTCCCATT CGCAAACCAG TGTGGAGGAA GCCTGGGAGG 720 

TGCGTCAAAA CTCAGGCAAG ATGTATGATG CnAACCCTC CCAATGTCTG CCAGAGGGAC 780 

GGGCAGTGT6 ACGGCAAATA CAAGTGCTGT 6AGGGTATAT GTGGGAAAGT CTGCCTGCCC 840 

CCGATGTGAG CCTGATCCCT GACATTGGCG CCGGCTCTGG ACTCGTGCTC GGTGTGCTCT 900 

30 

GGAAACTACT TCCCTGCTCC CAGGCGTCCC TGCTCCCGGT TCCATGGCTC CCGCCTCCCT 960 

GTATCCCAGG CnCGATCCT GTGGACCAGG GTTACTGTTT TACCACTAAC ATCTCCTTTT 1020 

GGCTCAGCAT TCACCGAGCT HAGGGAAAT GCTGnOGAG AGCAAATAAA TAAACGCAn 1080 

35 CATTTCTCTA TGCAAAAAAA AAAAAAAAAA AAAA 1114 

SEQ ID NO: 8 
SEQUENCE LENGIH: 684 

40 

SEQUENCE TYPE: nucleic acid 
STBANDEDNESS: double 
TOPOLOGY: linear 
« MOLECULE TYPE: cDNA to aRNA 

HYPOTHETICAL: No 
ANTI SENSE: No 
ORIGINAL SOURCE: 

50 

ORGANISM: Mus Musculus 
CELL LINE: IMC-HAl cell 
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ORIGINAL SOUECE: 

LIBBARY: IMC-HAl cell-derived cDNA library 

CLONE: 28-1 #3 

FEATURE: 

NAME/KEY: CDS 

LOCATION: 40.. 420 
inpMTmpjTinw uvTunn. e 

lA/blv 1 11 1 1 VII IMJlllWi/. u 

SEQUENCE DESCRIPTION: SEQ ID NO: 8 

TGmnAAT GATCTGGAAG CACCAGAGGC CAGTnOGG ATG CCA AAC CCC TAG CTA 57 

Met Pro Asn Pro Tyr Leu 
1 5 

ACC AGA AGA AGA GAA GAA AGG CCA CTG CCG AGG TCA CH CCA GTA CT! 105 
Thr Arg Arg Arg GIu GIu Arg Pro Leu Pro Arg Ser Leu Pro Val Leu 

10 15 20 

GGA GAT GCT ATC AAA ATC GGA GCC TGC CCT GCT AAA AAG CCT GCC CAG 153 
Gly Asp Ala He Lys He Gly Ala Cys Pro Ala Lys Lys Pro Ala 61n 

25 30 35 

TGC CTT AAG CTT GAG AAG CCA CAA TGC CGT ACT GAC TGG GAG TGC CCG 201 
Cys Leu Lys Leu GIu Lys Pro Gin Cys Arg Thr Asp Trp GIu Cys Pro 

40 45 50 

GGA AAG CAG AGG TGC TGC CAA GAT GCT TGC GGT TCC AAG TGC GTG AAT 249 
Gly Lys Gin Arg Cys Cys Gin Asp Ala Cys Gly Ser Lys Cys Val Asn 
55 60 65 70 

CCT GTT CCC ATI CGC AAA CCA GTG TGG AGG AAG CCT GGG AGG TGC GTC 297 
Pro Val Pro He Arg Lys Pro Val Trp Arg Lys Pro Gly Arg Cys Val 

75 80 85 

AAA ACT CAG GCA AGA TGT ATG ATG CTT AAC CCT CCC AAT GTC TGC CAG 345 
Lys Thr Gin Ala Arg Cys Met Met Leu Asn Pro Pro Asn Val Cys Gin 

90 95 100 

AGG GAC GGG CAG TGT GAC GGC AAA TAC AAG TGC TGT GAG GGT ATA TGT 393 
Arg Asp Gly Gin Cys Asp Gly Lys Tyr Lys Cys Cys GIu Gly He Cys 

105 110 115 

GGG AAA GTC TGC CTG CCC CCG ATG TGAGCCTGAT CCCTGACAH GGCGCCGGCT 447 
Gly Lys Val Cys Leu Pro Pro Met 
120 125 



26 



EP 0 974 652 A1 



CTGGACTCGT GCTCGGTGTG CTCTGGAAAC 
GGGnCCATG GCTCCCGfiCT CCCTGTATCC 
GTmACCAC TAACATCTCC TnTGGCTCA 
GGAGAGCAAA TAAATAAACG CATTCATTTC 

'0 SEQ ID NO: 9 

SEQUENCE LENGTH: iE6 
SEQUENCE TYPE: aaino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE: 

ORGANISM: Mus Musculus 

20 CELL LINE: IMC-HAl cell 



TACTTCCCTG CTCCCAGGCG TCCCTGCTCC 507 

CAGGCTTGGA TCCTGTG6AC CAGGGTTACT 567 

GCATTCACCG AGCTTTAGGG AAATGCTGH 627 

TCTATGCAAA AAAAAAAAAA AAAAAAA 684 



SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Met Pro Asn Pro Tyr Leu 
1 5 
Thr Arg Arg Arg GIu GIu Arg Pro Leu Pro Arg Ser Leu Pro Val Leu 

10 15 20 

Gly Asp Ala He Lys lie Gly Ala Cys Pro Ala Lys Lys Pro Ala Gin 

25 30 35 

Cys Leu Lys Leu GIu Lys Pro Gin Cys Arg Thr Asp Trp GIu Cys Pro 

40 45 50 

Gly Lys Gin Arg Cys Cys Gin Asp Ala Cys Gly Ser Lys Cys Val Asn 
55 60 65 70 

Pro Val Pro lie Arg Lys Pro Val Trp Arg Lys Pro Gly Arg Cys Val 

75 80 85 

Lys Thr Gin Ala Arg Cys Met Met Leu Asn Pro Pro Asn Val Cys Gin 

90 95 100 

Arg Asp Gly Gin Cys Asp Gly Lys Tyr Lys Cys Cys GIu Gly lie Cys 

105 110 115 

Gly Lys Val Cys Leu Pro Pro Met 
120 . 125 

SEQ ID NO: 10 
SEQUENCE LENGTH: 25 
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SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 10 

ATTAACCCTC ACTAAATGCT GGGBA 

SEQ ID NO: 11 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 11 

ATTAACCCTC ACTAAATGCT GGAGG 

SEQ ID NO: 12 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 12 

ATTAACCCTC ACTAAATGCT GGTGG 

SEQ ID NO: 13 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 13 

ATTAACCCTC ACTAAATGCT GGTAG 

SEQ ID NO: 14 
SEQUENCE LENGTH: 26 
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SEQUENCE TYPE: nucleic acid 

STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 14 

ATTAACCCTC ACTAAAGATC TGACTC 

SEQ ID NO: 15 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 15 

AHAACCCTC ACTAAATGCT GGGTG 

SEQ ID NO: 16 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 16 

ATTAACCCTC ACTAAATGCT GTATG 

SEQ ID NO: 17 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 17 

ATTAACCCTC ACTAAATGGA GCTGG 

SEQ ID NO: 18 
SEQUENCE LENGTH: 25 
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SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 18 

AnAACCCTC ACTAAATGTG GCAGG 

SEQ ID NO: 19 

SEQUENCE LENGTH: 25 

SEQUENCE TYPE: nucleic acid 

STEANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 19 

AHAACCCTC ACTAAAGCAC CGTCC 

SEQ ID NO: 20 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STEANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 20 

CATTATGCTG AGTGATATCT TTTTTTTTAA 

SEQ ID NO: 21 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STEANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCEIPTION: SEQ ID NO: 21 

CAmTGCTG AGTGATATCT TTTTTTTTAC 

SEQ ID NO: 22 
SEQUENCE LENGTH: 30 
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STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 22 

CAHATGCTG AGTGATATCT TTinnTAG 



SEQUENCE LENGIH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 23 

CATTATGCTC AGTGATATCT TTTTTITrCA 

SEQ ID NO: 24 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 24 

CAnATGCTG AGTGATATCT TTTTTTTTCC 

SEQ ID NO: 25 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 25 

CATTATGCTC AGTGATATCT TTTTTTTTCG 

SEQ ID NO: 26 
SEQUENCE LENGTH: 30 
STRANDEDNESS: single 
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TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 26 
CAHATGCTG AGTGATATCT TTTTTTrTGA 

SEQ ID NO: 27 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 27 

CATTATGCTG AGTGATATCT TTTTTTTTGC 

SEQ ID NO: 28 

SEQUENCE LENGTH: 30 

SEQUENCE TYPE: nucleic acid 

STBANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 28 

CATTATGCTG AGTGATATCT TTTTnnGG 

SEQ ID NO: 29 

SEQUENCE LENGTH: 20 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 29 

CAGAATCT6C TCATGCAGTC 

SEQ ID NO: 30 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CACTCCTTAC mCCACCCC 

SEQ ID NO: 31 

SVOIIVMPP TPMRTTI. 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 31 

AACAGCATTT CCCTAAAGCT CGG 

SEQ ID NO: 32 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 32 

TGGAAACTAC nCCCTGCTC CCA 

SEQ ID NO: 33 

SEQUENCE LENGTH: 20 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 33 

AAnAACCCT CACTAAA6GG 

SEQ ID NO: 34 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GTAATACGAC TCACTATAGG GC 

SEQ ID NO: 35 

SEQUENCE LtnGTH: 26 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

7F2 

SEQUENCE DESCRIPTION: SEQ ID NO: 35 
CTCGGGACTG GTCATCAGAG CTCCCC 

SEQ ID NO: 36 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 37 

GACTGTATCT GCTTTGCTCT GCAC 

SEQ ID NO: 37 

SEQUENCE LENGTH: 800 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: cDNA to aENA 

HYPOTHETICAL: 

ANTISENSE: No 

ORIGINAL SOURCE: 

ORGANISM: Mas Muscukus 
CELL LINE: IMC-HAl cell 

FEATURE: 
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NAME/KEY: CDS 
LOCATION: 83.. 583 
* IDENTIFICATION METHOD : E 

SEQUENCE DESCRIPTION: SEQ ID NO: 37 
10 ACAGACCACT GCCCCCACCT GCCCT6CGCC ATCTACCCAA GAAGGCTCGG CACGGGCACC 60 

AACCACTGCC TCCAACTGCC CC ATG CTG CCT GAG AAG GCA CTG CAC GGC CAC 112 

Met Leu Pro Glu Lys Ala Leu His Gly His 
1 5 10 

CCC CAA CTG CCC CGC ACT GTC CCT ACC CGG GCA GCC ATG CGA GCG GCT 160 
Pro Gin Leu Pro Arg Thr Val Pro Thr Arg Ala Ala Met Arg Ala Ala 
15 20 25 

20 GGA ACT CTG CTG GCC TTC TGC TGC CTG GTC TTG AGC ACC ACT GGG GGC 208 

Gly Thr Leu Leu Ala Phe Cys Cys Leu Val Leu Ser Thr Thr Gly Gly 

30 35 40 

CCT TCC CCA GAT ACT TGT TCC CAG GAC CH AAC TCA CGT GTG AAG CCA 256 
Pro Ser Pro Asp Thr Cys Ser Gin Asp Leu Asn Ser Arg Val Lys Pro 

45 50 55 

CGA TTT CCT AAA ACA ATA AAG ACC AAT GAC CCA GGA GTC CTC CAA GCA 304 
Gly Phe Pro Lys Thr He Lys Thr Asn Asp Pro Gly Val Leu Gin Ala 

60 65 70 

GCC'A'GA TAC ACT GTT GAA AAG TIC AAC AAC TGC ACG AAC GAC ATG TIC 352 
Ala Arg Tyr Ser Val Glu Lys Phe Asn Asn Cys Thr Asn Asp Met Phe 
35 75 80 85 90 

TTG TTC AAG GAG TCC CGC ATC ACA AGG GCC CTA GTT CAG ATA GTG AAA 400 
Leu Phe Lys Glu Ser Arg lie Thr Arg Ala Leu Val Gin lie Val Lys 
95 100 105 

40 

GGC CTG AAA TAT ATG CTC GAG CTG GAA AH GGC AGA ACT ACC TGC AAG 448 
Gly Leu Lys Thr Met Leu Glu Val Glu He Gly Arg Thr Thr Cys Lys 
110 115 120 

45 AAA AAC CAG CAC CTG CGT CTG GAT GAC TGT GAC TO CAA ACC AAC CAC 496 

Lys Asn Gin His Leu Arg Leu Asp Asp Cys Asp Phe Gin Thr Asn His 

125 130 135 

ACC TTG AAG CAG ACT CTG AGC TGC TAC TCT GAA GTC TGG GTC GTG CCC 544 

50 

Thr Leu Lys Gin Thr Leu Ser Cys Tyr Ser Glu Val Trp Val Val Pro 
140 145 150 
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TGG CTC CAG CAC TO GAG GTG CCT GTT CTC C6T TGT CAC TGACCCCCGC 593 
Trp Leu Gin His Phe Glu Val Pro Val Leu Arg Cys His 
155 160 165 

CTCTTCAGCA AGACCACAGC CATGACAAAC ACCAGGATGC ATGCTCCTT6 TCCCCTCCCA 653 

CCCGCCTCAT GACCCAGCCT CACAGACCCT CTCAGGCCTC TGACGA6TGA GCGGGTGAAG 713 

TGCCACTGGG TCACCGCAGG GCAGCTGGAA TGGCAGCATG GTA6CGCCTC CTAACAGATT 773 

AAATAGATCA CATTTGCTO TAAAAH 800 



15 



20 



25 



30 



40 



SO 



SEQ ID NO: 38 

SEQUENCE LENGTH: 167 

SEQUENCE TYPE: aaino acid 

TOPOLOGY: linear 

MOLECULE TYPE: protein 

ORIGINAL SOURCE: 

ORGANISM: Mus Muscukus 
CELL LINE: IMC-HAl cell 



SEQUENCE DESCRIPTION: SEQ ID NO: 38 

Met Leu Pro Glu Lys Ala Leu His Gly His 
1 5 10 

Pro (3ln Leu Pro Arg Thr Val Pro Thr Arg Ala Ala Met Arg Ala Ala 

15 20 25 

Gly Thr Leu Leu Ala Phe Cys Cys Leu Val Leu Ser Thr Thr Gly Gly 
35 30 35 40 

Pro Ser Pro Asp Thr Cys Ser Gin Asp Leu Asn Ser Arg Val Lys Pro 

45 50 55 

Gly Phe Pro Lys Thr He Lys Thr Asn Asp Pro Gly Val Leu Gin Ala 

60 65 70 

Ala Arg Tyr Ser Val Glu Lys Phe Asn Asn Cys Thr Asn Asp Met Phe 
75 80 85 90 

« Leu Phe Lys Glu Ser Arg He Thr Arg Ala Leu Val Gin lie Val Lys 

95 100 105 

Gly Leu Lys Thr Met Leu Glu Val Glu lie Gly Arg Thr Thr Cys Lys 

110 115 120 

Lys Asn Gin His Leu Arg Leu Asp Asp Cys Asp Phe Gin Thr Asn His 
125 130 135 
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Thr Leu Lys Gin Thr Leu Ser Cys Tyr Ser Glu Val Trp Val Val Pro 

140 145 150 

Trp Leu Gin His Phe Glu Val Pro Val Leu Arg Cys His 
155 160 165 

SEQ ID NO: 39 

SEQUENCE LENGTH: 21 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 39 

ACAGACACTG CCCCCACCTG C 

SEQ ID NO: 40 

SEQUENCE LENGTH: 22 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 40 

AATTHAGAA GCAAATGTGA TC 

SEQ ID NO: 41 

SEQUENCE LENGTH: 23 

SEQUENCE nPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 41 

ACATGTCGTT CGTGCAGnO TTG 

SEQ ID NO: 42 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
SEQUENCE DESCRIPTION: SEQ ID NO: 42 
AAnGGCAGA ACTACCTGCA AG 

SEQ ID NO: 43 

SEQUENCE LENGTH: 22 

SEQUENCE TYPE: nucleic acid 

STEANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 43 

GGTCnGCTG AAGAGGCGGG GG 

SEQ ID NO: 44 

SEQUENCE LENGTH: 22 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 44 

CACTGTCCCT ACCCGGGCAG CC 

SEQ ID NO: 45 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 45 

ATCTACCCAA GAA6GCTCGG CAC 

SEQ ID NO: 46 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 46 

TCTGTTAGCA GGCGCTAGGA TGC 23 

SEQ ID NO: 47 

SEQUENCE LENGIB: 35 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 47 

AATTCGGTAC CAGCTGAAGC TACCCCACCA TGCCC 35 

SEQ ID NO: 48 

SEQUENCE LENGTH: 34 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 48 

AGTCGCTAGC AGAGGAGAAC AGGCACCTCA AAAC 34 

SEQ ID NO: 49 

SEQUENCE LENGTH: 24 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 49 

TGTTATTAAT GATCTGGAAG CACC 24 



Claims 



A protein comprising any one of the amino acid sequences set forth in SEQ ID NOs: 4, 6, 9. or 38, or a protein com- 
prising any one of said amino acid sequences having substitution, deletion, or addition of one or more amino acids 
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and having cancer metastasis potency. 

2. A protein encoded by DNA which hytxidizes with DNA comprising any one of the nucleotide sequences set forth in 
SEQ ID NOs: 3, 5, 7, 8, or 37, said protein having cancer metastasis potency. 

3. A DNA encoding the protein according to claim 1 . 

4. The DNA according to claim 3, comprising any one of the nucleotide sequences set forth in SEQ ID NOs: 3, 5, 7, 
8, or 37. 

5. A DNA hybridizing with a DNA comprising any one of the nucleotide sequences set forth in SEQ ID NOs: 3, 5, 7, 
8, or 37. encoding a protein having cancer metastasis potency. 

6. A vector can-ying the DNA according to any one of claims 3 to 5. 

7. A Iransformant hartioring the vector according to claim 6. 

8. A method for preparing the protein according to claim 1 or 2, comprising culturing the transformant set forth in daim 
7. 

9. A DNA specifically hybridizing with the DNA according to any one of claims 3 to 5. comprising at least 1 5 nucleotide 
residues. 

1 0. An antisense DNA against the DNA according to any one of claims 3 to 5 or a portion thereof. 

11. An antibody binding to the protein according to claim 1 or 2. 

12. A method for screening a compound having cancer metastasis inhibitory ability, comprising steps of: 

(a) contacting a test sample with the protein according to claim 1 or 2, and 

(b) selecting compounds having the activity to bind to the protein according to daim 1 or 2. 

13. A method for screening compounds having cancer metastasis inhibitory ability, conprising steps of: 

(a) contacting test samples with cells expressing the protein according to claim 1 or 2, 

(b) detecting the expression level of the protein according to daim 1 or 2 in cells contaded with test samples 
using the antitxidy set forth in daim 1 1 , and 

(c) selecting a compound which reduces the expression level of the protein according to claim 1 or 2 as com- 
pared with that in cells not contaded with test samples. 
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Figure 1 
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Figure 2 
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Figure 3 
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